How to rewrite this code?
|
|
0
|
50
|
July 11, 2024
|
MBART-50 looks not compatible with pipeline
|
|
0
|
67
|
July 10, 2024
|
Attentions not returned from transformers ViT model when using output_attentions=True
|
|
4
|
801
|
July 10, 2024
|
[SOLVED] :Difference between `eval_strategy` and `evaluation_strategy`
|
|
2
|
1252
|
July 10, 2024
|
Loading a locally saved model is very slow
|
|
1
|
3628
|
July 10, 2024
|
Mask2Former IoU a lot worse than Maskformer's IoU on same dataset
|
|
2
|
465
|
July 10, 2024
|
Error when executing pix2struct-widget-captioning-base model
|
|
6
|
1292
|
July 10, 2024
|
Trainer.evaluate() vs trainer.predict()
|
|
6
|
35917
|
July 10, 2024
|
Evaluating huggingface transformer with trainer gives different results
|
|
0
|
911
|
March 22, 2023
|
ValueError: too many values to unpack (expected 3) using the DETR model
|
|
1
|
643
|
July 10, 2024
|
Deploying inference model size and performance
|
|
6
|
5168
|
July 9, 2024
|
Tokenize_newline_separately argument unused in PaliGemma processor
|
|
0
|
87
|
July 9, 2024
|
Mask2Former for Binary segmentation
|
|
1
|
519
|
July 9, 2024
|
Facing error while adding multiple adapters to a model
|
|
1
|
607
|
July 8, 2024
|
Greedy sampling with the new branch
|
|
0
|
130
|
July 8, 2024
|
Expected mat1 and mat2 to have the same dtype, but got: c10::Half != float
|
|
3
|
1682
|
July 8, 2024
|
Further finetuning a LoRA finetuned CausalLM Model
|
|
17
|
10524
|
July 7, 2024
|
Evaluate subset of data during training
|
|
5
|
5486
|
July 6, 2024
|
SFTTrainer takes up so much ram that it breaks an A100 GPU
|
|
0
|
193
|
July 6, 2024
|
BitsAndBytesConfig is not compitable in TPU env
|
|
2
|
233
|
July 6, 2024
|
How can I retrain VisualBERT on another dataset?
|
|
5
|
184
|
July 6, 2024
|
How can I load a locally store model with TGI and Docker?
|
|
4
|
1501
|
July 6, 2024
|
Different model performance after saving and loading Donut model
|
|
1
|
340
|
July 6, 2024
|
TypeError: 'NoneType' object does not support item assignment when using Evaluate
|
|
0
|
158
|
July 5, 2024
|
How can I create AI content Detector
|
|
0
|
185
|
July 5, 2024
|
Loaded models lost their predictive power
|
|
3
|
151
|
July 5, 2024
|
GPU memory usage of optimizer's states when using LoRA
|
|
4
|
696
|
July 5, 2024
|
Trainer Config Logging
|
|
0
|
82
|
July 4, 2024
|
Please save me : GPT like model Generation gone wrong
|
|
0
|
54
|
July 4, 2024
|
KV Cache Managment
|
|
0
|
493
|
July 4, 2024
|