Fine tune with SFTTrainer
|
|
17
|
14952
|
September 12, 2024
|
MLflow.js temp check
|
|
0
|
19
|
September 11, 2024
|
Need your help in making the AI Model responses more effective
|
|
0
|
23
|
September 11, 2024
|
What does optimizer_update_8bit function in bitsandbytes.functional actually do with its state1 and state2 parameters?
|
|
0
|
20
|
September 10, 2024
|
Seeking Advice on Processing Support Conversations for Efficient RAG Model Search
|
|
0
|
53
|
September 9, 2024
|
InformationRetrievalEvaluator with training semantic search model
|
|
0
|
277
|
September 6, 2024
|
Rope Factor issues with meta-llama/Meta-Llama-3.1-70B
|
|
3
|
436
|
August 31, 2024
|
How can I dynamically update the system configuration for different users using my demo?
|
|
6
|
31
|
August 28, 2024
|
TGI and turn off Flash Attention v2
|
|
4
|
1938
|
August 23, 2024
|
Study with AI researchers and development team members
|
|
0
|
34
|
August 22, 2024
|
Function Call via HuggingFaceLLM
|
|
1
|
292
|
August 22, 2024
|
+8 Fiverr AI devs (including *Pro* and *Top rated*) couldn't do this apparently easy development, can you?
|
|
2
|
69
|
August 22, 2024
|
Finetuning a pre-trained model
|
|
0
|
63
|
August 21, 2024
|
Include more features per token while training the BERT model
|
|
0
|
14
|
August 21, 2024
|
How to pass table structure to LLM model
|
|
2
|
1596
|
May 1, 2024
|
Why isn't quantization config reducing memory usage?
|
|
0
|
107
|
August 16, 2024
|
Experience with and extending LLM for software engineering
|
|
4
|
541
|
August 15, 2024
|
How to replace the weights of certain layers in a model
|
|
1
|
174
|
August 14, 2024
|
Update different parts of the model with different dataset
|
|
0
|
37
|
August 13, 2024
|
Inference endpoint
|
|
1
|
33
|
August 11, 2024
|
Practicality and Efficiency of Using Non-Power-of-Two Context Lengths in Fine-Tuning Hugging Face Models for SFT or Fine-Tuning
|
|
0
|
20
|
August 8, 2024
|
Multi GPU traning with Accelerator vs Trainer
|
|
2
|
162
|
August 6, 2024
|
Image lost xmp data on uploads
|
|
0
|
32
|
August 5, 2024
|
SFTTrainer for Llama-2
|
|
0
|
106
|
August 3, 2024
|
How to Fine-Tune Phi3-Vision Model with LoRA for Recognizing UI Elements in Images?
|
|
0
|
155
|
August 1, 2024
|
How to correctly freeze some of the Wav2Vec2-Bert’s layers?
|
|
0
|
133
|
July 30, 2024
|
Size Mismatch when loading Lora Adapter for Phi3
|
|
0
|
230
|
July 30, 2024
|
New Merger Development Request
|
|
0
|
33
|
July 29, 2024
|
Classifying text based on intent using bert
|
|
0
|
44
|
July 29, 2024
|
RNN-T predict only blank
|
|
0
|
25
|
July 28, 2024
|