Does generate's max_length influence training?
|
|
0
|
103
|
April 25, 2024
|
Finetuned State-space/mamba model not working on huggingface model
|
|
0
|
107
|
April 25, 2024
|
DPOTrainer consumes lots of VRAM
|
|
0
|
215
|
April 25, 2024
|
Error to import transformers[torch] or accelerate -U
|
|
0
|
389
|
April 25, 2024
|
Prohibitively large RAM consumption on Trainer validation
|
|
2
|
1640
|
April 24, 2024
|
ValueError: Mixed precision training with AMP or APEX (`--fp16`) and FP16 evaluation can only be used on CUDA devices
|
|
9
|
23435
|
April 24, 2024
|
Multiple time fine-tuning VideoMAE model adding n class each time
|
|
0
|
94
|
April 24, 2024
|
How to not show the progress bar for evaluation only?
|
|
1
|
645
|
April 24, 2024
|
How to disable Huggingface Hub during Trainer saving of PEFT models?
|
|
2
|
667
|
April 24, 2024
|
Multivariate time-series transformer
|
|
0
|
190
|
April 24, 2024
|
Why the model loading of llama2 is so slow?
|
|
6
|
9603
|
April 24, 2024
|
Out of bounds Error in label conversion , two labels getting converted to 0 and 247
|
|
0
|
90
|
April 24, 2024
|
How to cluster words into semantic entities, when performing information extraction?
|
|
2
|
1102
|
April 23, 2024
|
Program hangs when creating a transformers.TrainingArguments object
|
|
2
|
437
|
April 23, 2024
|
Unable to open file 'model.bin' in model 'ct2fast_m2m100_418M'
|
|
1
|
1355
|
April 22, 2024
|
Confusing Benchmark results Running whisper on 4080 Super vs A10 vs H100
|
|
0
|
494
|
April 22, 2024
|
How to finetune with a own private data and then build chatbot on that?
|
|
4
|
13850
|
February 16, 2024
|
Conversion from finetune m2m_100 model to huggingface format
|
|
0
|
112
|
April 22, 2024
|
ETA for training time is 60k hours for first generation
|
|
0
|
107
|
April 22, 2024
|
Error after 501 steps
|
|
2
|
513
|
April 22, 2024
|
Issue with Optuna visualization in web browser
|
|
0
|
190
|
April 20, 2024
|
Pipeline device issue, torch_xla generation() bug, flax models malloc errors
|
|
0
|
176
|
April 21, 2024
|
Config parameters for custom models
|
|
0
|
108
|
April 21, 2024
|
Interface API deployment
|
|
0
|
90
|
April 21, 2024
|
Model Parallism
|
|
0
|
186
|
April 21, 2024
|
Fine tuning a sentence-transformer for cosine sim on 500k sentence pairs without labels-- advice
|
|
2
|
1214
|
April 20, 2024
|
I can not download the llama2 model with transformers on gcp
|
|
0
|
160
|
April 19, 2024
|
Batching in "automatic-speech-recognition" pipelines
|
|
2
|
2321
|
April 19, 2024
|
Getting warning message on creation of WeightedLossTrainer object
|
|
1
|
1875
|
April 19, 2024
|
Concatenating in vision transformer
|
|
0
|
133
|
April 19, 2024
|