RuntimeError: CUDA error: named symbol not found when using TorchAoConfig with Qwen2.5-VL-7B-Instruct model
|
|
5
|
52
|
July 24, 2025
|
Evaluation step very slow
|
|
2
|
897
|
July 24, 2025
|
Human pose estimation models
|
|
2
|
1017
|
July 24, 2025
|
Continued pretraining of Llama 3-8b on a new language
|
|
1
|
87
|
July 23, 2025
|
Webhook usecase
|
|
0
|
4
|
July 23, 2025
|
As of transformers v4.44, default chat template is no longer allowed
|
|
3
|
4834
|
July 23, 2025
|
Proper way of saving/loading models for complex workflows
|
|
2
|
56
|
July 22, 2025
|
Cannot import name 'Wav2Vec2Processor'
|
|
2
|
58
|
July 22, 2025
|
ImportError: cannot import name '_expand_mask' from 'transformers.models.bloom.modeling_bloom'
|
|
2
|
1475
|
July 21, 2025
|
Timeout Issue with DeepSpeed on Multiple GPUs
|
|
2
|
623
|
July 21, 2025
|
InformerForPrediction [I would like to seek your opinions, everyone]
|
|
0
|
8
|
July 20, 2025
|
I was excited about the D-FINE model, but I have got ABYSMAL Results
|
|
3
|
146
|
July 19, 2025
|
How to use a data collator when dealing with text and images
|
|
2
|
1140
|
July 17, 2025
|
CUDA Out Of Memory when training a DETR Object detection model with compute_metrics
|
|
3
|
121
|
July 17, 2025
|
Function/tool calling using Transformer models
|
|
5
|
1047
|
July 17, 2025
|
Pytorch Language Modeling Example for Seq2Seq Models
|
|
0
|
12
|
July 16, 2025
|
Mes Spaces restent bloqués sur âStartingâ malgré abonnement Pro et hébergement GPU
|
|
2
|
46
|
July 14, 2025
|
Fine-tune for function call on Meta-Llama-3.1-8B-Instruct
|
|
6
|
125
|
July 15, 2025
|
Object detection resolution fine-tuning
|
|
1
|
41
|
July 14, 2025
|
deBERTa v3 implementation in HuggingFace (with RTD training)
|
|
5
|
342
|
July 12, 2025
|
Make repo-consistency fails even for intentional tweaks in a copied model
|
|
9
|
44
|
July 11, 2025
|
When Fine-tuning a object detection model which parameters do we update?
|
|
1
|
37
|
July 10, 2025
|
Understanding T5 with custom embedding
|
|
3
|
29
|
July 9, 2025
|
How can I make the LLM to forget the knowledge?
|
|
3
|
69
|
July 9, 2025
|
Can I use a custom attention layer while still leveraging a pre-trained BERT model?
|
|
4
|
27
|
July 8, 2025
|
OneFormer ID/Labels for FineTuning
|
|
13
|
43
|
July 8, 2025
|
How to save the best trial's model using `trainer.hyperparameter_search`
|
|
6
|
2717
|
July 8, 2025
|
This is my fine tuning trocr code why is it not working anyone please help me I really need your help I am working on new language
|
|
9
|
86
|
July 8, 2025
|
Accuracy decreasing after saving/reloading my model
|
|
3
|
10
|
July 8, 2025
|
Java version of transformers library?
|
|
2
|
197
|
July 7, 2025
|