PEFT fine-tuning Mistral-7B-Instruct-v0.2 - Warning messages
|
|
0
|
507
|
April 19, 2024
|
Transformers CausalLM loss is always nan
|
|
0
|
187
|
April 18, 2024
|
Error when increasing max_length for tokenizer - OverflowError: out of range integral type conversion attempted
|
|
0
|
513
|
April 18, 2024
|
Ref_model in DPOTrainer
|
|
0
|
170
|
April 18, 2024
|
Gather Input tensor at index 1 has invalid shape
|
|
1
|
801
|
April 18, 2024
|
Module error not found: "torch.utils._pytree"
|
|
1
|
3036
|
April 17, 2024
|
Why is Trainer single-threaded during "Generating split..."?
|
|
0
|
297
|
April 17, 2024
|
Generating Once for 16 Tokens is Not Same Generating Single Token 16 Times?
|
|
4
|
286
|
April 17, 2024
|
Setting weights as adapter weights
|
|
0
|
118
|
April 17, 2024
|
Would PyTorch's FSDP work with a model loaded using device_map='auto'?
|
|
0
|
259
|
April 17, 2024
|
Triaging cudaErrorIllegalAddress Error
|
|
2
|
1577
|
April 17, 2024
|
Trainer freezes/crashes after evaluation step
|
|
6
|
1678
|
April 16, 2024
|
How to Modify LLaMA 2 Model for Internal Token Generation Timing
|
|
0
|
268
|
April 16, 2024
|
How Labelled Data is Processed | Transformers Trainer
|
|
10
|
4502
|
April 16, 2024
|
Need help to reduce CLIP image embedding time
|
|
0
|
192
|
April 15, 2024
|
Custom config error when model.save_pretrained
|
|
3
|
2161
|
April 15, 2024
|
CLIP scores, with vector input rather than image input
|
|
0
|
266
|
April 15, 2024
|
Early_stopping_patience param in EarlyStoppingCallback
|
|
2
|
3345
|
April 15, 2024
|
Invalid Key Error when Training GPT2 from Scratch using trainer.train()
|
|
3
|
1537
|
April 15, 2024
|
Can I use "AutoModel For Sequence Classification" class for generative models?
|
|
2
|
753
|
April 15, 2024
|
Looking for exploratory study / best practices for LoRA adapters config (LLM fine-tuning)
|
|
0
|
377
|
April 15, 2024
|
Access feature in custom compute_loss method
|
|
0
|
194
|
April 15, 2024
|
Trocr Model not utilising gpu even I am specified that
|
|
0
|
326
|
April 15, 2024
|
Import transformers fails; installation issue?
|
|
1
|
1993
|
April 15, 2024
|
Hugging face course enough for understanding transformer and llm stuff
|
|
2
|
187
|
March 10, 2024
|
When I try to use my fine-tuned Causal LM model to inference a prompt, I get nothing but the last word repeated multiple times
|
|
1
|
537
|
April 14, 2024
|
Solving error for mismatch tensor size
|
|
0
|
329
|
April 14, 2024
|
Padding options for LayoutLM processor
|
|
0
|
145
|
April 14, 2024
|
Help with Sparse LLM Implementation
|
|
0
|
205
|
April 14, 2024
|
Model for image regression
|
|
0
|
213
|
April 13, 2024
|