SFTTrainer too slow during the build (or ingestion) phase
|
|
0
|
99
|
November 27, 2024
|
How to correct use loss function from Pytorch
|
|
0
|
112
|
November 26, 2024
|
Continuous Memory Usage increasing
|
|
0
|
84
|
November 26, 2024
|
All GPUs at 100% except GPU0 at 0%?
|
|
0
|
31
|
November 25, 2024
|
New Version of PPOTrainer
|
|
6
|
477
|
November 24, 2024
|
Error from Notebook
|
|
1
|
17
|
November 23, 2024
|
Load a large model to multipe, specific GPUs (without CUDA_VISIBLE_DEVICES)
|
|
0
|
184
|
November 22, 2024
|
Class transformers.ReactAgent
|
|
0
|
34
|
November 22, 2024
|
Error in Question answering comput_metrics
|
|
0
|
49
|
November 21, 2024
|
Bert-base for text classification and MLFlow
|
|
3
|
256
|
November 21, 2024
|
Problems when loading checkpoints
|
|
2
|
435
|
November 20, 2024
|
CUDA OOM on first backward pass after evaluation
|
|
0
|
283
|
November 20, 2024
|
N-to-N translation model training/fine-tuning
|
|
0
|
66
|
November 20, 2024
|
For helping doctors! Please help me finetune Phi3 on the following dataset: openlifescienceai/medmcqa
|
|
0
|
47
|
November 20, 2024
|
Ethical AI: How Should We Ensure Fairness in NLP Models?
|
|
0
|
143
|
November 19, 2024
|
GPT2: hidden states get by output_hidden_states is different from those by register_forward_hook
|
|
0
|
98
|
November 19, 2024
|
How to set 'max_length' properly when using pipeline?
|
|
4
|
1754
|
November 18, 2024
|
CLIPVisionModel Padding Problem
|
|
2
|
166
|
November 18, 2024
|
Issues with Configuring dtype for Local Models in Whisper-Web (Experimental WebGPU)
|
|
0
|
304
|
November 17, 2024
|
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:1 and cuda:0!
|
|
28
|
114880
|
November 17, 2024
|
Filtering Sampled for Sequence Length
|
|
0
|
64
|
November 16, 2024
|
Can not reproduce result when finetune BERT model
|
|
3
|
288
|
November 16, 2024
|
Trainer vs seq2seqtrainer
|
|
4
|
15451
|
November 15, 2024
|
Availability of the 'argilla/notux-chat-ui' model
|
|
1
|
269
|
November 15, 2024
|
Fine-Tuning Strategies: Choosing Between microsoft/mpnet-base and sentence-transformers/all-MiniLM-L6-v2
|
|
2
|
606
|
November 15, 2024
|
How to teach DETR to detect only BBox for one specific object, without classification
|
|
3
|
197
|
November 14, 2024
|
How to use Transformers ViTs with different resolutions like in timm?
|
|
0
|
81
|
November 14, 2024
|
How can I specify `stop_strings` in `generation_config.json`?
|
|
1
|
551
|
November 14, 2024
|
How do I change the classification head of a model?
|
|
31
|
53231
|
November 14, 2024
|
Fine-tuning BERT Model on domain specific language and for classification
|
|
7
|
8475
|
November 14, 2024
|