Error from Notebook
|
|
1
|
14
|
November 23, 2024
|
Is LLaMA rotary embedding implementation correct?
|
|
6
|
5770
|
November 23, 2024
|
Load a large model to multipe, specific GPUs (without CUDA_VISIBLE_DEVICES)
|
|
0
|
62
|
November 22, 2024
|
Class transformers.ReactAgent
|
|
0
|
25
|
November 22, 2024
|
Error in Question answering comput_metrics
|
|
0
|
37
|
November 21, 2024
|
Bert-base for text classification and MLFlow
|
|
3
|
49
|
November 21, 2024
|
Problems when loading checkpoints
|
|
2
|
86
|
November 20, 2024
|
CUDA OOM on first backward pass after evaluation
|
|
0
|
74
|
November 20, 2024
|
N-to-N translation model training/fine-tuning
|
|
0
|
57
|
November 20, 2024
|
For helping doctors! Please help me finetune Phi3 on the following dataset: openlifescienceai/medmcqa
|
|
0
|
44
|
November 20, 2024
|
HFvalidationerror: Repo_id must be in the form repo_name
|
|
5
|
12395
|
September 16, 2024
|
Ethical AI: How Should We Ensure Fairness in NLP Models?
|
|
0
|
118
|
November 19, 2024
|
GPT2: hidden states get by output_hidden_states is different from those by register_forward_hook
|
|
0
|
49
|
November 19, 2024
|
How to set 'max_length' properly when using pipeline?
|
|
4
|
241
|
November 18, 2024
|
CLIPVisionModel Padding Problem
|
|
2
|
114
|
November 18, 2024
|
Issues with Configuring dtype for Local Models in Whisper-Web (Experimental WebGPU)
|
|
0
|
248
|
November 17, 2024
|
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:1 and cuda:0!
|
|
28
|
97549
|
November 17, 2024
|
Filtering Sampled for Sequence Length
|
|
0
|
54
|
November 16, 2024
|
Can not reproduce result when finetune BERT model
|
|
3
|
184
|
November 16, 2024
|
Trainer vs seq2seqtrainer
|
|
4
|
13153
|
November 15, 2024
|
Availability of the 'argilla/notux-chat-ui' model
|
|
1
|
218
|
November 15, 2024
|
Fine-Tuning Strategies: Choosing Between microsoft/mpnet-base and sentence-transformers/all-MiniLM-L6-v2
|
|
2
|
103
|
November 15, 2024
|
How to teach DETR to detect only BBox for one specific object, without classification
|
|
3
|
64
|
November 14, 2024
|
How to use Transformers ViTs with different resolutions like in timm?
|
|
0
|
36
|
November 14, 2024
|
How can I specify `stop_strings` in `generation_config.json`?
|
|
1
|
102
|
November 14, 2024
|
How do I change the classification head of a model?
|
|
31
|
50532
|
November 14, 2024
|
Fine-tuning BERT Model on domain specific language and for classification
|
|
7
|
7930
|
November 14, 2024
|
Is it possible mAP accuracy detr during training?
|
|
1
|
267
|
November 13, 2024
|
CLAP fine-tuning: error raised for is_longer variable and enable_fusion
|
|
3
|
58
|
November 13, 2024
|
SFTTrainer training very slow on GPU. Is this training speed expected?
|
|
2
|
39
|
November 12, 2024
|