Help in model training strategies (PEFT/LORA + RAG)
|
|
0
|
120
|
November 2, 2024
|
How do I correct this message in Replicate and HF
|
|
0
|
79
|
November 1, 2024
|
Advices for learning AI, ML, DL
|
|
4
|
179
|
November 2, 2024
|
Caching issues with MarianMT
|
|
0
|
23
|
November 1, 2024
|
Should I Be Here?
|
|
5
|
908
|
November 1, 2024
|
How to transition from linguistic prompt engineering to NLP/ML/FT
|
|
1
|
591
|
November 1, 2024
|
Webui-user bat file gives errors with Torch
|
|
8
|
562
|
November 1, 2024
|
Error using SFTTrainer: Make sure that your dataset has enough samples to at least yield one packed sequence
|
|
9
|
3033
|
November 1, 2024
|
How modify UI of `gr.Interface`?
|
|
3
|
69
|
November 1, 2024
|
How do I resume training a finetuned model from the epoch it has ended
|
|
3
|
993
|
October 31, 2024
|
Great news! Time to generate music!
|
|
1
|
1065
|
October 31, 2024
|
How to get embedding matrix of bert in hugging face
|
|
8
|
41192
|
October 31, 2024
|
Lama 3.23b performs great when I download and use using ollama but when I manually download the model or if I use the gguf model by unsloth, it gives me irrelevant response. Please help me out
|
|
9
|
1409
|
October 31, 2024
|
Shall I let users to add / delete some gr.Textbox in ui?
|
|
2
|
19
|
October 31, 2024
|
How shall make logon ui better?
|
|
8
|
23
|
October 31, 2024
|
Trainer.train() hangs with multiple GPUs (but GPUs show activity)
|
|
4
|
913
|
October 31, 2024
|
Is IterableDataset automatically reshuffled after each epoch in Trainer?
|
|
0
|
125
|
October 31, 2024
|
How to have independent cache in spaces
|
|
7
|
79
|
October 30, 2024
|
Error when running eval on Mamba LORA with PEFT
|
|
3
|
113
|
October 30, 2024
|
Exception encountered: Unrecognized keyword arguments: ['batch_shape']
|
|
7
|
476
|
October 30, 2024
|
Embedding Assistant on my website
|
|
3
|
1150
|
October 29, 2024
|
Outputs.hidden_states[0][-1] always returns the same logit regardless of the question
|
|
0
|
42
|
October 29, 2024
|
Which model to start my project?
|
|
3
|
77
|
October 29, 2024
|
Speech recognition max length
|
|
2
|
122
|
October 29, 2024
|
Unable to inference from space in python using API
|
|
4
|
142
|
October 29, 2024
|
Local Installation Autotrain Token Issue
|
|
1
|
221
|
October 29, 2024
|
Building email responder model
|
|
0
|
52
|
October 29, 2024
|
Data problem for live support for my e-commerce site
|
|
0
|
18
|
October 28, 2024
|
Dataloader with streaming dataset for image captioning (BLIP finetune)
|
|
1
|
21
|
October 28, 2024
|
How to use model from civitai
|
|
3
|
1273
|
October 28, 2024
|