🤗Transformers

Topic	Replies	Views	Activity
ValueError when using PatchTSTForClassification 🤗Transformers	1	138	May 28, 2024
"No space left on device" when using model.push_to_hub() 🤗Transformers	1	328	May 28, 2024
Why GPU not be use in Evaluation of Trainer 🤗Transformers	0	95	May 28, 2024
LayoutLMV3 inference without label 🤗Transformers	0	97	May 28, 2024
Llama-2 output from forward function is nonsense, `.generate()` is okay 🤗Transformers	3	1017	May 27, 2024
Finetuning in 8bit with Custom Head 🤗Transformers	0	137	May 27, 2024
Torchrun uses more vram than running the script with python directly 🤗Transformers	1	340	May 27, 2024
How can you switch between adapters in the inference model? 🤗Transformers	2	378	May 27, 2024
Need Help Improving Similarity Scores for Follow-up Detection Using BERT or similar 🤗Transformers	1	113	May 26, 2024
Fine tuning t5 to write like me 🤗Transformers	0	163	May 26, 2024
Is it possible to get the data that is seen by the model during training? 🤗Transformers	1	124	May 26, 2024
Parallelize Mistral/ llama2 output 🤗Transformers	1	153	May 25, 2024
Decision Transformer a question about the tutorial 🤗Transformers	0	127	April 15, 2024
Understanding the Decision Transformer 🤗Transformers	0	136	May 25, 2024
How to get the loss from the Trainer class? 🤗Transformers	0	159	May 25, 2024
Modify the model input format in a .tflite file generated by the run_image_classification.py script 🤗Transformers	0	106	May 24, 2024
How to get list of downloaded models names? 🤗Transformers	6	4604	May 24, 2024
Mistral load_in_8bit slow inference 🤗Transformers	0	237	May 24, 2024
Perplexity Calculation in run_clm.py 🤗Transformers	0	257	May 23, 2024
Can I dynamically add or remove LoRA weights in the transformer library like diffusers 🤗Transformers	3	875	May 23, 2024
Is it possible to generate more than one token when using a decoder only model via forward pass? 🤗Transformers	1	586	May 23, 2024
Trainer RuntimeError: The size of tensor a (462) must match the size of tensor b (448) at non-singleton dimension 1 🤗Transformers	17	44224	May 23, 2024
ValueError: too many values to unpack (expected 2) or not enough values to unpack (expected 2, got 1). T5ForConditionalGeneration 🤗Transformers	0	170	May 23, 2024
T5 tokenizer / ideal method of calculating max_sequence_length? 🤗Transformers	1	539	May 22, 2024
Pass input_embed to WhisperDecoder 🤗Transformers	0	82	May 22, 2024
How to fix ValueError: The model did not return a loss from the inputs? 🤗Transformers	1	600	May 22, 2024
Transformers.js went wrong during the model construction 🤗Transformers	0	448	May 21, 2024
System RAM gets full in sometime and ( VideoMAE ) training job is killed 🤗Transformers	0	65	May 21, 2024
What data batch does SFTTrainer looks at when resumed training 🤗Transformers	0	105	May 21, 2024
TypeError: LlamaForCausalLM.__init__() got an unexpected keyword argument 'load_in_4bit' 🤗Transformers	7	20134	October 7, 2023