Overcoming Overfitting in Transformer Fine-Tuning?
|
|
0
|
466
|
February 29, 2024
|
Wav2Vec Classification on Labeled Data
|
|
0
|
95
|
February 28, 2024
|
Deepspeed trainer and custom loss weights
|
|
1
|
563
|
February 28, 2024
|
CUDA OUT OF MEMORY on MULTI GPU
|
|
0
|
726
|
February 28, 2024
|
Index out of range in self while using the LILT Model
|
|
1
|
273
|
February 25, 2024
|
Llama2 70b - Cuda out of memory exceptions
|
|
0
|
162
|
February 28, 2024
|
Logging & Experiment tracking with W&B
|
|
78
|
45321
|
February 28, 2024
|
Trainer attribute, n_gpu
|
|
0
|
165
|
February 28, 2024
|
Can you fine tune a CausalLM model (GPT2) to seq2seq, redefining the architecture or do I need to retrain the model from scratch?
|
|
0
|
347
|
February 28, 2024
|
Issue loading quantised model
|
|
0
|
278
|
February 28, 2024
|
HuggingFace Transformers Error When Saving Model: TypeError: Object of type method is not JSON serializable
|
|
1
|
2569
|
February 27, 2024
|
Plotting train accuracy and loss with Trainer
|
|
2
|
3331
|
February 27, 2024
|
T5 decoder predicting tokens even after hitting end of sequence token, i.e </s>
|
|
4
|
333
|
February 26, 2024
|
Huggingface tokenizer object has no attribute 'pad'
|
|
1
|
1546
|
February 26, 2024
|
Difference between pipeline and model.generate?
|
|
2
|
2588
|
February 26, 2024
|
Not able to minimize loss during finetuning
|
|
0
|
122
|
February 26, 2024
|
Issue on Kosmos-2 model training on new dataset
|
|
3
|
443
|
February 25, 2024
|
How can I use Inference API with my model?
|
|
0
|
146
|
February 24, 2024
|
Unable to load a saved custom model
|
|
0
|
563
|
February 24, 2024
|
LoRA finetuning without quantization (8bit)
|
|
1
|
985
|
February 23, 2024
|
Pipeline max_length
|
|
2
|
3965
|
February 23, 2024
|
"invalid kernel image" when using HF llama trainer
|
|
1
|
424
|
February 23, 2024
|
Change Transformers version on Huggingface Inference Endpoint?
|
|
0
|
210
|
February 23, 2024
|
Equivalent of limit_val_batches in trainer class
|
|
0
|
117
|
February 22, 2024
|
Changing Hidden size in Clip Text encoder
|
|
0
|
264
|
February 22, 2024
|
Multilabel Audio Classification Training size mismatch
|
|
3
|
372
|
February 22, 2024
|
Xls-r fine tuening does not give the same output when running multiple times
|
|
0
|
83
|
February 22, 2024
|
Unexpected keyword argument 'sampling_strategy' with TrainingArguments
|
|
0
|
1777
|
February 22, 2024
|
Finetune LLM with DeepSpeed
|
|
2
|
5146
|
February 22, 2024
|
DeepSpeed integration for HuggingFace Seq2SeqTrainingArguments
|
|
0
|
1517
|
February 22, 2024
|