[Guide] How I debugged T5 fine-tuning for a medical diagnosis task
|
|
0
|
8
|
August 4, 2025
|
🤗 Transformers This category is for any question related to the Transformers library.
|
|
0
|
4
|
August 4, 2025
|
Prakash Hinduja, Geneva (Swiss) How can I ask effective technical questions on the Hugging Face forum?
|
|
1
|
20
|
August 4, 2025
|
Generate dataset for fine tuning on PDF(s)
|
|
7
|
3415
|
August 3, 2025
|
Attention mechanism
|
|
6
|
59
|
August 2, 2025
|
AttributeError: 'ORTTrainingArguments' object has no attribute 'deepspeed_plugin'
|
|
2
|
503
|
August 2, 2025
|
Why can't transformers be decoupled?
|
|
2
|
19
|
August 1, 2025
|
ValueError: boxes1 must be in [x0, y0, x1, y1] (corner) format
|
|
4
|
115
|
August 1, 2025
|
How to prepare dataset for fine-tuning a VLM (open-vocabulary detection, COCO format)?
|
|
2
|
31
|
August 1, 2025
|
RT-DETRV2 and normalization
|
|
5
|
33
|
July 31, 2025
|
What’s the Best Way to Fine-Tune a Transformer Model on a Custom Dataset Using the Transformers Library?
|
|
1
|
18
|
July 31, 2025
|
Using transformers on Kaggle
|
|
1
|
16
|
July 29, 2025
|
Potential bug in the rt-detr v2 fine tune script
|
|
5
|
284
|
July 29, 2025
|
Ensure the sentence is complete during generation
|
|
6
|
7072
|
July 28, 2025
|
Multi-gpu huggingface training using trl
|
|
1
|
454
|
July 28, 2025
|
Fine-tune Mistral 7B–9B or 24B (bnb 4bit)
|
|
3
|
25
|
July 26, 2025
|
GETTING ERROR >> AttributeError: 'InferenceClient' object has no attribute 'post'
|
|
17
|
1267
|
July 26, 2025
|
RuntimeError: CUDA error: named symbol not found when using TorchAoConfig with Qwen2.5-VL-7B-Instruct model
|
|
5
|
35
|
July 24, 2025
|
Evaluation step very slow
|
|
2
|
870
|
July 24, 2025
|
Human pose estimation models
|
|
2
|
991
|
July 24, 2025
|
Continued pretraining of Llama 3-8b on a new language
|
|
1
|
45
|
July 23, 2025
|
Webhook usecase
|
|
0
|
4
|
July 23, 2025
|
As of transformers v4.44, default chat template is no longer allowed
|
|
3
|
4217
|
July 23, 2025
|
Proper way of saving/loading models for complex workflows
|
|
2
|
43
|
July 22, 2025
|
Cannot import name 'Wav2Vec2Processor'
|
|
2
|
22
|
July 22, 2025
|
ImportError: cannot import name '_expand_mask' from 'transformers.models.bloom.modeling_bloom'
|
|
2
|
1413
|
July 21, 2025
|
Timeout Issue with DeepSpeed on Multiple GPUs
|
|
2
|
564
|
July 21, 2025
|
InformerForPrediction [I would like to seek your opinions, everyone]
|
|
0
|
7
|
July 20, 2025
|
I was excited about the D-FINE model, but I have got ABYSMAL Results
|
|
3
|
118
|
July 19, 2025
|
How to use a data collator when dealing with text and images
|
|
2
|
1134
|
July 17, 2025
|