SFTTrainer too slow during the build (or ingestion) phase
|
|
0
|
93
|
November 27, 2024
|
How to correct use loss function from Pytorch
|
|
0
|
110
|
November 26, 2024
|
Continuous Memory Usage increasing
|
|
0
|
76
|
November 26, 2024
|
All GPUs at 100% except GPU0 at 0%?
|
|
0
|
29
|
November 25, 2024
|
New Version of PPOTrainer
|
|
6
|
393
|
November 24, 2024
|
Error from Notebook
|
|
1
|
17
|
November 23, 2024
|
Load a large model to multipe, specific GPUs (without CUDA_VISIBLE_DEVICES)
|
|
0
|
159
|
November 22, 2024
|
Class transformers.ReactAgent
|
|
0
|
33
|
November 22, 2024
|
Error in Question answering comput_metrics
|
|
0
|
42
|
November 21, 2024
|
Bert-base for text classification and MLFlow
|
|
3
|
199
|
November 21, 2024
|
Problems when loading checkpoints
|
|
2
|
335
|
November 20, 2024
|
CUDA OOM on first backward pass after evaluation
|
|
0
|
242
|
November 20, 2024
|
N-to-N translation model training/fine-tuning
|
|
0
|
64
|
November 20, 2024
|
For helping doctors! Please help me finetune Phi3 on the following dataset: openlifescienceai/medmcqa
|
|
0
|
45
|
November 20, 2024
|
Ethical AI: How Should We Ensure Fairness in NLP Models?
|
|
0
|
137
|
November 19, 2024
|
GPT2: hidden states get by output_hidden_states is different from those by register_forward_hook
|
|
0
|
82
|
November 19, 2024
|
How to set 'max_length' properly when using pipeline?
|
|
4
|
1504
|
November 18, 2024
|
CLIPVisionModel Padding Problem
|
|
2
|
151
|
November 18, 2024
|
Issues with Configuring dtype for Local Models in Whisper-Web (Experimental WebGPU)
|
|
0
|
301
|
November 17, 2024
|
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:1 and cuda:0!
|
|
28
|
112437
|
November 17, 2024
|
Filtering Sampled for Sequence Length
|
|
0
|
63
|
November 16, 2024
|
Can not reproduce result when finetune BERT model
|
|
3
|
248
|
November 16, 2024
|
Trainer vs seq2seqtrainer
|
|
4
|
14866
|
November 15, 2024
|
Availability of the 'argilla/notux-chat-ui' model
|
|
1
|
247
|
November 15, 2024
|
Fine-Tuning Strategies: Choosing Between microsoft/mpnet-base and sentence-transformers/all-MiniLM-L6-v2
|
|
2
|
500
|
November 15, 2024
|
How to teach DETR to detect only BBox for one specific object, without classification
|
|
3
|
169
|
November 14, 2024
|
How to use Transformers ViTs with different resolutions like in timm?
|
|
0
|
68
|
November 14, 2024
|
How can I specify `stop_strings` in `generation_config.json`?
|
|
1
|
473
|
November 14, 2024
|
How do I change the classification head of a model?
|
|
31
|
52737
|
November 14, 2024
|
Fine-tuning BERT Model on domain specific language and for classification
|
|
7
|
8400
|
November 14, 2024
|