🤗Transformers

Topic	Replies	Views	Activity
Trainer predict or evaluate returns zero for metrics 🤗Transformers	0	52	July 11, 2024
Re-initialize decoder parameters of a pretrained model 🤗Transformers	0	60	July 11, 2024
Model is getting loaded unevenly on GPUs 🤗Transformers	1	49	July 11, 2024
Track multiple losses & different outputs size with Trainer and callbacks 🤗Transformers	4	3065	July 11, 2024
How to rewrite this code? 🤗Transformers	0	50	July 11, 2024
MBART-50 looks not compatible with pipeline 🤗Transformers	0	68	July 10, 2024
Attentions not returned from transformers ViT model when using output_attentions=True 🤗Transformers	4	805	July 10, 2024
[SOLVED] :Difference between `eval_strategy` and `evaluation_strategy` 🤗Transformers	2	1291	July 10, 2024
Loading a locally saved model is very slow 🤗Transformers	1	3640	July 10, 2024
Mask2Former IoU a lot worse than Maskformer's IoU on same dataset 🤗Transformers	2	465	July 10, 2024
Error when executing pix2struct-widget-captioning-base model 🤗Transformers	6	1293	July 10, 2024
Trainer.evaluate() vs trainer.predict() 🤗Transformers	6	35989	July 10, 2024
Evaluating huggingface transformer with trainer gives different results 🤗Transformers	0	911	March 22, 2023
ValueError: too many values to unpack (expected 3) using the DETR model 🤗Transformers	1	643	July 10, 2024
Deploying inference model size and performance 🤗Transformers	6	5172	July 9, 2024
Tokenize_newline_separately argument unused in PaliGemma processor 🤗Transformers	0	87	July 9, 2024
Mask2Former for Binary segmentation 🤗Transformers	1	519	July 9, 2024
Facing error while adding multiple adapters to a model 🤗Transformers	1	612	July 8, 2024
Greedy sampling with the new branch 🤗Transformers	0	130	July 8, 2024
Expected mat1 and mat2 to have the same dtype, but got: c10::Half != float 🤗Transformers	3	1694	July 8, 2024
Further finetuning a LoRA finetuned CausalLM Model 🤗Transformers	17	10541	July 7, 2024
Evaluate subset of data during training 🤗Transformers	5	5507	July 6, 2024
SFTTrainer takes up so much ram that it breaks an A100 GPU 🤗Transformers	0	195	July 6, 2024
BitsAndBytesConfig is not compitable in TPU env 🤗Transformers	2	235	July 6, 2024
How can I retrain VisualBERT on another dataset? 🤗Transformers	5	184	July 6, 2024
How can I load a locally store model with TGI and Docker? 🤗Transformers	4	1508	July 6, 2024
Different model performance after saving and loading Donut model 🤗Transformers	1	343	July 6, 2024
TypeError: 'NoneType' object does not support item assignment when using Evaluate 🤗Transformers	0	158	July 5, 2024
How can I create AI content Detector 🤗Transformers	0	186	July 5, 2024
Loaded models lost their predictive power 🤗Transformers	3	151	July 5, 2024