Intermediate

Topic	Replies	Views	Activity
Ccreate continues set of generated images - same style and characters	0	158	May 20, 2024
What Model and approach should i use for my use case	2	164	May 20, 2024
Build pretrained huggingface whisper for tensorrt-llm	0	324	May 20, 2024
How to use `inputs_embed` and `attention_mask` together?	1	957	May 19, 2024
What is the correct way to parse data for DPO? Do you seperate out prompt or not?	0	212	May 19, 2024
GPU memory usage is twice (2x) what I calculated based on number of parameters and floating point precision	5	457	May 18, 2024
Change saving metric in Trainer	2	1485	May 18, 2024
Huggingface token returning an invalid token	1	1453	May 17, 2024
Can't change max_input_length of Text Generation Inference	0	137	May 15, 2024
Questions about Mistral and apply_chat_template with Text Generation Inference, openai API and messages API	0	175	May 15, 2024
Question regarding adding a 4080 (and 3080?) to a 4090 rig for AI	2	488	May 15, 2024
Getting nan while fine tuning Blip 2 and weired output	0	148	May 14, 2024
Push model to hugging face hub without Trainer	7	1416	May 14, 2024
Implement few-shot inference for question-answering with DistilBERT	0	156	May 13, 2024
Negative KL-divergence RLHF implementation	1	1595	May 13, 2024
Llama2 tools instruction wierd reponse	2	163	May 8, 2024
Adding another head to Vision encoder decoder model	4	343	May 7, 2024
Quantize a Model before loading it for pre-training?	0	134	May 7, 2024
GPT-2 Data Preparation for Parsing Trees	0	124	May 6, 2024
Tokenizer from a GGUF file in Python?	1	844	May 6, 2024
Train modell for Question Answering	3	320	May 6, 2024
Stochastic Sampling with Trainer.evaluate() Logits	3	328	May 6, 2024
Trouble loading checkpoint shards for microsoft/Phi-3-mini-4k-instruct	1	1097	May 5, 2024
CUDA OOM on model(inputs) but not on model.generate(inputs), but doesn't generate use model(inputs)?	4	256	May 4, 2024
Giving attention mask to ppo_trainer	0	241	May 4, 2024
Less Trainable Parameters after quantization	14	4474	May 2, 2024
Fine tuning RoBerta got an unexpected keyword argument 'labels'	2	1049	May 1, 2024
Fine-tuning with Different Model Heads	4	811	April 30, 2024
What are the limits on saving private models and datasets on the hub?	4	1539	April 29, 2024
Finetuning T5 for Summarisation - Poor results	1	545	April 28, 2024