Problems when loading checkpoints
|
|
2
|
71
|
November 20, 2024
|
CUDA OOM on first backward pass after evaluation
|
|
0
|
57
|
November 20, 2024
|
N-to-N translation model training/fine-tuning
|
|
0
|
49
|
November 20, 2024
|
For helping doctors! Please help me finetune Phi3 on the following dataset: openlifescienceai/medmcqa
|
|
0
|
36
|
November 20, 2024
|
HFvalidationerror: Repo_id must be in the form repo_name
|
|
5
|
11492
|
September 16, 2024
|
Ethical AI: How Should We Ensure Fairness in NLP Models?
|
|
0
|
87
|
November 19, 2024
|
GPT2: hidden states get by output_hidden_states is different from those by register_forward_hook
|
|
0
|
48
|
November 19, 2024
|
How to set 'max_length' properly when using pipeline?
|
|
4
|
170
|
November 18, 2024
|
CLIPVisionModel Padding Problem
|
|
2
|
111
|
November 18, 2024
|
Trainer in PEFT doesn't report evaluation metrics
|
|
2
|
138
|
November 18, 2024
|
Issues with Configuring dtype for Local Models in Whisper-Web (Experimental WebGPU)
|
|
0
|
242
|
November 17, 2024
|
Pre-training DeBERTaV2 - config questions
|
|
5
|
1118
|
November 17, 2024
|
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:1 and cuda:0!
|
|
28
|
95216
|
November 17, 2024
|
Filtering Sampled for Sequence Length
|
|
0
|
51
|
November 16, 2024
|
Can not reproduce result when finetune BERT model
|
|
3
|
178
|
November 16, 2024
|
Trainer vs seq2seqtrainer
|
|
4
|
13008
|
November 15, 2024
|
Availability of the 'argilla/notux-chat-ui' model
|
|
1
|
216
|
November 15, 2024
|
Fine-Tuning Strategies: Choosing Between microsoft/mpnet-base and sentence-transformers/all-MiniLM-L6-v2
|
|
2
|
81
|
November 15, 2024
|
How to teach DETR to detect only BBox for one specific object, without classification
|
|
3
|
56
|
November 14, 2024
|
How to use Transformers ViTs with different resolutions like in timm?
|
|
0
|
35
|
November 14, 2024
|
How can I specify `stop_strings` in `generation_config.json`?
|
|
1
|
68
|
November 14, 2024
|
How do I change the classification head of a model?
|
|
31
|
50217
|
November 14, 2024
|
Fine-tuning BERT Model on domain specific language and for classification
|
|
7
|
7882
|
November 14, 2024
|
New Version of PPOTrainer
|
|
5
|
85
|
November 14, 2024
|
Is it possible mAP accuracy detr during training?
|
|
1
|
248
|
November 13, 2024
|
CLAP fine-tuning: error raised for is_longer variable and enable_fusion
|
|
3
|
42
|
November 13, 2024
|
SFTTrainer training very slow on GPU. Is this training speed expected?
|
|
2
|
28
|
November 12, 2024
|
Compatibility Issue of Transformers Library with TensorFlow 2.18
|
|
2
|
40
|
November 12, 2024
|
Speed issues using tokenizer.train_new_from_iterator on ~50GB dataset
|
|
7
|
1825
|
November 11, 2024
|
T5 variants return Training Loss 0 and Validation loss nan while fine tuning
|
|
8
|
4771
|
November 10, 2024
|