🤗Transformers

Topic	Replies	Views	Activity
MBART-50 looks not compatible with pipeline 🤗Transformers	0	70	July 10, 2024
Attentions not returned from transformers ViT model when using output_attentions=True 🤗Transformers	4	1014	July 10, 2024
[SOLVED] :Difference between `eval_strategy` and `evaluation_strategy` 🤗Transformers	2	2258	July 10, 2024
Loading a locally saved model is very slow 🤗Transformers	1	3937	July 10, 2024
Mask2Former IoU a lot worse than Maskformer's IoU on same dataset 🤗Transformers	2	511	July 10, 2024
Error when executing pix2struct-widget-captioning-base model 🤗Transformers	6	1307	July 10, 2024
Trainer.evaluate() vs trainer.predict() 🤗Transformers	6	37319	July 10, 2024
Evaluating huggingface transformer with trainer gives different results 🤗Transformers	0	922	March 22, 2023
ValueError: too many values to unpack (expected 3) using the DETR model 🤗Transformers	1	652	July 10, 2024
Deploying inference model size and performance 🤗Transformers	6	5225	July 9, 2024
Tokenize_newline_separately argument unused in PaliGemma processor 🤗Transformers	0	88	July 9, 2024
Mask2Former for Binary segmentation 🤗Transformers	1	547	July 9, 2024
Facing error while adding multiple adapters to a model 🤗Transformers	1	761	July 8, 2024
Greedy sampling with the new branch 🤗Transformers	0	135	July 8, 2024
Expected mat1 and mat2 to have the same dtype, but got: c10::Half != float 🤗Transformers	3	1850	July 8, 2024
Further finetuning a LoRA finetuned CausalLM Model 🤗Transformers	17	11032	July 7, 2024
Evaluate subset of data during training 🤗Transformers	5	5722	July 6, 2024
SFTTrainer takes up so much ram that it breaks an A100 GPU 🤗Transformers	0	215	July 6, 2024
BitsAndBytesConfig is not compitable in TPU env 🤗Transformers	2	246	July 6, 2024
How can I retrain VisualBERT on another dataset? 🤗Transformers	5	187	July 6, 2024
How can I load a locally store model with TGI and Docker? 🤗Transformers	4	1627	July 6, 2024
Different model performance after saving and loading Donut model 🤗Transformers	1	377	July 6, 2024
TypeError: 'NoneType' object does not support item assignment when using Evaluate 🤗Transformers	0	170	July 5, 2024
How can I create AI content Detector 🤗Transformers	0	205	July 5, 2024
Loaded models lost their predictive power 🤗Transformers	3	154	July 5, 2024
GPU memory usage of optimizer's states when using LoRA DeepSpeed	4	816	July 5, 2024
Trainer Config Logging 🤗Transformers	0	87	July 4, 2024
Please save me : GPT like model Generation gone wrong 🤗Transformers	0	55	July 4, 2024
KV Cache Managment 🤗Transformers	0	524	July 4, 2024
Insights generation 🤗Transformers	0	41	July 4, 2024