Training Loss 0.0000 and Validation Loss nan
|
|
1
|
89
|
October 3, 2024
|
Windows 11 does not see my 2nd GPU (4090 + 4080)
|
|
2
|
256
|
October 3, 2024
|
Not able to predict using Transformers Trainer class
|
|
2
|
120
|
October 2, 2024
|
How do I finetune Blip2 model on a custom dataset?
|
|
1
|
246
|
October 1, 2024
|
Loading Dataset from Cache Data
|
|
1
|
59
|
September 30, 2024
|
Self correction by model
|
|
7
|
81
|
September 30, 2024
|
Can't locate the error in my dataset
|
|
3
|
141
|
September 30, 2024
|
LayoutLMv3 processor error
|
|
4
|
81
|
September 27, 2024
|
Implementation of NER model with relationship extraction?
|
|
3
|
6346
|
September 25, 2024
|
How to stop LLM from going up to the max token limit?
|
|
1
|
86
|
September 25, 2024
|
Missing files ? Missing config.json File After AutoTrain on Hugging Face
|
|
1
|
73
|
October 15, 2024
|
DPO training data format
|
|
7
|
914
|
September 23, 2024
|
Encoding/decoding NLP model in tensorflow lite (fine-tuned GPT2)
|
|
3
|
1116
|
September 21, 2024
|
Fine tuning with conversation dialog data
|
|
0
|
76
|
September 20, 2024
|
Why does tokenizer.apply_chat_template() add multiple eos tokens?
|
|
4
|
406
|
September 19, 2024
|
Qlora Training with Custom Trainer
|
|
0
|
55
|
September 19, 2024
|
Fuzzy title matching
|
|
0
|
27
|
September 17, 2024
|
FineTuning a CasualLM with a text file
|
|
0
|
86
|
September 17, 2024
|
Developing a cartoon story
|
|
3
|
39
|
September 16, 2024
|
Re-Initialize Trainer object in for loop, does model update itself?
|
|
0
|
20
|
September 15, 2024
|
Runtime error Exit code: 1. Reason: Traceback (most recent call last): File "/home/user/ann/ann.nu" line 3 in modules Logs Build Container Application Startup at 2024-09-15 08:20:37 Traceback (most recent call last): File "/home/user/app/app.py",
|
|
1
|
150
|
September 15, 2024
|
Problem with transformer Trainer with torch CustomDataset, during fine-tuning
|
|
3
|
342
|
September 12, 2024
|
Fine tune with SFTTrainer
|
|
17
|
11955
|
September 12, 2024
|
MLflow.js temp check
|
|
0
|
19
|
September 11, 2024
|
Need your help in making the AI Model responses more effective
|
|
0
|
17
|
September 11, 2024
|
What does optimizer_update_8bit function in bitsandbytes.functional actually do with its state1 and state2 parameters?
|
|
0
|
15
|
September 10, 2024
|
Seeking Advice on Processing Support Conversations for Efficient RAG Model Search
|
|
0
|
38
|
September 9, 2024
|
InformationRetrievalEvaluator with training semantic search model
|
|
0
|
158
|
September 6, 2024
|
Rope Factor issues with meta-llama/Meta-Llama-3.1-70B
|
|
3
|
324
|
August 31, 2024
|
How can I dynamically update the system configuration for different users using my demo?
|
|
6
|
31
|
August 28, 2024
|