Fine tune with SFTTrainer
|
|
5
|
2043
|
March 15, 2024
|
Less Trainable Parameters after quantization
|
|
12
|
2447
|
March 15, 2024
|
Best Model for Question + Answer Embeddings
|
|
0
|
107
|
March 15, 2024
|
Transforming Pushed Hugging Face Models into Usable GGUF Models for Local Colab Use
|
|
2
|
519
|
March 15, 2024
|
Fine tuning LLM for text classification -- error with SFTTrainer
|
|
0
|
346
|
March 15, 2024
|
Loss and results misunderstanding
|
|
0
|
78
|
March 14, 2024
|
NLP Training data
|
|
0
|
73
|
March 14, 2024
|
Accessing AltCLIP text encoder
|
|
0
|
66
|
March 14, 2024
|
Training Instruct-pix-2-pix with my own dataset: torch error
|
|
0
|
151
|
March 13, 2024
|
How to train an already finetuned LLM(LLama2)?
|
|
0
|
159
|
March 13, 2024
|
HfHubHTTPError: 502 Server Error: Bad Gateway for url: https://api-inference.huggingface.co/models/HuggingFaceH4/zephyr-7b-beta
|
|
0
|
117
|
March 13, 2024
|
Evaluating RAG only with open-source
|
|
0
|
202
|
March 12, 2024
|
Transformer vs Sentence-Transformer for text classification
|
|
0
|
205
|
March 12, 2024
|
LayoutLMv3 Inference
|
|
2
|
576
|
March 11, 2024
|
How to use DeepSparse in Transformer?
|
|
1
|
189
|
March 11, 2024
|
LLM or NLP project idea for final year
|
|
0
|
245
|
March 8, 2024
|
Same seed across different gpus in multiple workers
|
|
0
|
138
|
March 8, 2024
|
Multi-gpu batch processing fails when using Peft Lora with Huggingface
|
|
1
|
927
|
March 8, 2024
|
The Correct Attention Mask For Examples Packing
|
|
4
|
1035
|
March 8, 2024
|
Model Performance and Sanity check
|
|
0
|
122
|
March 7, 2024
|
Use RAGAS with huggingface LLM
|
|
2
|
734
|
March 7, 2024
|
12% into epoch training loss drops to 0.0
|
|
2
|
241
|
March 6, 2024
|
Seq2Seq Learning rate
|
|
2
|
194
|
March 6, 2024
|
Patent toolkit latency Issue
|
|
0
|
76
|
March 4, 2024
|
MaskFormer Jagged Edges Issues of output masks
|
|
0
|
87
|
March 3, 2024
|
Difference between model.generate() and model() outputs
|
|
2
|
1128
|
March 3, 2024
|
Tokenizer causes TRL completion data collator failure
|
|
0
|
158
|
March 3, 2024
|
Retraining peft model
|
|
3
|
2290
|
March 1, 2024
|
How language of the prompt impacts on model performance
|
|
0
|
79
|
February 29, 2024
|
Gradio Error: UndefinedError: 'str object' has no attribute 'role'
|
|
1
|
398
|
February 29, 2024
|