🤗Transformers

Topic	Replies	Views	Activity
Which is actually used to configure scheduler in deepspeed and TrainingArguments? 🤗Transformers	0	93	May 17, 2024
How to control the GPU id for loading model weights when fintune Llama8B model with the Trainer? 🤗Transformers	0	76	May 17, 2024
Cannot import name 'WhisperForAudioClassification 🤗Transformers	0	160	May 16, 2024
Isn't KV cache influenced by position encoding in inference? 🤗Transformers	3	927	May 16, 2024
ModuleNotFoundError: No module named 'transformers.agents' 🤗Transformers	2	749	May 16, 2024
Why follow Flan-T5 template when T5 tokenizer ignores multiple newlines 🤗Transformers	0	114	May 15, 2024
Decoder only model - how to have it not include the prompt in its output? 🤗Transformers	3	664	May 15, 2024
ValueError: Unrecognized configuration class <class 'transformers.models.whisper.configuration_whisper.WhisperConfig'> 🤗Transformers	0	245	May 15, 2024
KeyError: 'eval_qwk' when used get_peft_model 🤗Transformers	0	112	May 14, 2024
Fedrated Learning using trainer 🤗Transformers	0	72	May 14, 2024
Next sentence prediction on custom model 🤗Transformers	3	3399	May 14, 2024
Llama 3 tokenizer prints cryptic message 🤗Transformers	0	158	May 13, 2024
Whisper Inference RuntimeError: The expanded size of the tensor (3000) must match the existing size (3392) at non-singleton dimension 1. Target sizes: [80, 3000]. Tensor sizes: [80, 3392] 🤗Transformers	1	749	May 13, 2024
HUBERT Implementation with increased vocabulary size 🤗Transformers	0	87	May 13, 2024
Model Parralelism approach in Llama Code looks like very inefficient 🤗Transformers	0	95	May 13, 2024
ValueError when training on a multi GPU setup and DPO 🤗Transformers	0	245	May 13, 2024
Transformer shifting output question 🤗Transformers	1	355	May 13, 2024
Not able to add data_collator to Trainer 🤗Transformers	1	636	May 13, 2024
How can we automatically run the script with a token included in a script 🤗Transformers	0	83	May 13, 2024
Index Error: Target {} is out of bounds 🤗Transformers	0	267	May 13, 2024
Load_in_8bit vs. loading 8-bit quantized model 🤗Transformers	6	6996	May 13, 2024
Convert Conv1D to nn.Linear 🤗Transformers	2	981	May 12, 2024
SFTrainer doesn't show added column 🤗Transformers	0	101	May 12, 2024
How can I keep use of the base model version for inference after fine-tuning 🤗Transformers	1	95	May 12, 2024
BartForConditionalGeneration: loss function diverges instead of converging 🤗Transformers	0	123	May 12, 2024
Beam search error 🤗Transformers	2	572	May 12, 2024
An error occurred: You have to specify input_ids 🤗Transformers	0	308	May 11, 2024
How to change max_length of a fine tuned model 🤗Transformers	4	11549	May 11, 2024
Phi3 Mini 4k Instruct Flash Attention not found 🤗Transformers	4	5153	May 11, 2024
LayoutLMv3 inference - bboxes are incorrect 🤗Transformers	0	120	May 10, 2024