MBART-50 looks not compatible with pipeline
|
|
0
|
70
|
July 10, 2024
|
Attentions not returned from transformers ViT model when using output_attentions=True
|
|
4
|
1014
|
July 10, 2024
|
[SOLVED] :Difference between `eval_strategy` and `evaluation_strategy`
|
|
2
|
2258
|
July 10, 2024
|
Loading a locally saved model is very slow
|
|
1
|
3937
|
July 10, 2024
|
Mask2Former IoU a lot worse than Maskformer's IoU on same dataset
|
|
2
|
511
|
July 10, 2024
|
Error when executing pix2struct-widget-captioning-base model
|
|
6
|
1307
|
July 10, 2024
|
Trainer.evaluate() vs trainer.predict()
|
|
6
|
37319
|
July 10, 2024
|
Evaluating huggingface transformer with trainer gives different results
|
|
0
|
922
|
March 22, 2023
|
ValueError: too many values to unpack (expected 3) using the DETR model
|
|
1
|
652
|
July 10, 2024
|
Deploying inference model size and performance
|
|
6
|
5225
|
July 9, 2024
|
Tokenize_newline_separately argument unused in PaliGemma processor
|
|
0
|
88
|
July 9, 2024
|
Mask2Former for Binary segmentation
|
|
1
|
547
|
July 9, 2024
|
Facing error while adding multiple adapters to a model
|
|
1
|
761
|
July 8, 2024
|
Greedy sampling with the new branch
|
|
0
|
135
|
July 8, 2024
|
Expected mat1 and mat2 to have the same dtype, but got: c10::Half != float
|
|
3
|
1850
|
July 8, 2024
|
Further finetuning a LoRA finetuned CausalLM Model
|
|
17
|
11032
|
July 7, 2024
|
Evaluate subset of data during training
|
|
5
|
5722
|
July 6, 2024
|
SFTTrainer takes up so much ram that it breaks an A100 GPU
|
|
0
|
215
|
July 6, 2024
|
BitsAndBytesConfig is not compitable in TPU env
|
|
2
|
246
|
July 6, 2024
|
How can I retrain VisualBERT on another dataset?
|
|
5
|
187
|
July 6, 2024
|
How can I load a locally store model with TGI and Docker?
|
|
4
|
1627
|
July 6, 2024
|
Different model performance after saving and loading Donut model
|
|
1
|
377
|
July 6, 2024
|
TypeError: 'NoneType' object does not support item assignment when using Evaluate
|
|
0
|
170
|
July 5, 2024
|
How can I create AI content Detector
|
|
0
|
205
|
July 5, 2024
|
Loaded models lost their predictive power
|
|
3
|
154
|
July 5, 2024
|
GPU memory usage of optimizer's states when using LoRA
|
|
4
|
816
|
July 5, 2024
|
Trainer Config Logging
|
|
0
|
87
|
July 4, 2024
|
Please save me : GPT like model Generation gone wrong
|
|
0
|
55
|
July 4, 2024
|
KV Cache Managment
|
|
0
|
524
|
July 4, 2024
|
Insights generation
|
|
0
|
41
|
July 4, 2024
|