Llamma index Saving and Loading
|
|
1
|
745
|
January 2, 2024
|
When using an SDXL base and refiner, should LORAs be sent to both?
|
|
0
|
762
|
December 30, 2023
|
Manual splitting of model across multi-GPU setup
|
|
1
|
4070
|
December 29, 2023
|
Audio classification VS Transcribing and using classifier
|
|
0
|
218
|
December 29, 2023
|
Data Conversion to Conll2003
|
|
4
|
839
|
December 28, 2023
|
[Solved] Cannot restart training from deepspeed checkpoint
|
|
3
|
2716
|
December 28, 2023
|
AWS Serverless Enpoint RAM size issue
|
|
0
|
139
|
December 27, 2023
|
Is iterative training advisable?
|
|
3
|
367
|
December 27, 2023
|
Hugging Face Evaluator
|
|
1
|
163
|
December 26, 2023
|
Why is there no Output when IDEFICS based model is run on CUDA?
|
|
0
|
572
|
December 26, 2023
|
Modify HF model for training
|
|
1
|
384
|
December 22, 2023
|
Running into cuda out of memory when running llama2-13b-chat model on multi-gpu machine
|
|
5
|
11107
|
December 21, 2023
|
Function Calling and RAG Features Using Open-Source LLMs
|
|
0
|
807
|
December 21, 2023
|
QLoRA memory requirement with 3B model loads GPU with 10GB of memory with 4bit quantization
|
|
0
|
1179
|
December 19, 2023
|
What infrastructure (compute, network, and storage) will support OpenLLaMA 7B model training, fine-tuning, and inferencing?
|
|
0
|
166
|
December 20, 2023
|
Mapping text that describes connected devices to a JSON object with chosen shape
|
|
2
|
422
|
December 19, 2023
|
Remove PE/Encoder on BartModel
|
|
0
|
198
|
December 18, 2023
|
How to choose optimal batch size for training LLMs?
|
|
4
|
19084
|
December 18, 2023
|
Whisper fine-tuning without Seq2SeqTrainer
|
|
0
|
350
|
December 15, 2023
|
Cuda out of memory issue training whisper model on single GPU
|
|
0
|
931
|
December 15, 2023
|
Time Series Transformer. Lagged values and time alignment
|
|
4
|
1083
|
December 14, 2023
|
BART - Input format
|
|
4
|
1790
|
December 13, 2023
|
No accuracy of model in autotrain
|
|
0
|
191
|
December 12, 2023
|
Saving model per some step when using Trainer
|
|
3
|
9298
|
December 11, 2023
|
I have a question about giving Image condition at diffusion models
|
|
0
|
582
|
December 11, 2023
|
How does generation work with compute_metrics
|
|
0
|
376
|
December 9, 2023
|
Training Fails after multiple passes: ValueError: The model did not return a loss from the inputs
|
|
3
|
4275
|
December 9, 2023
|
Random utf-8 errors from dataset
|
|
10
|
3646
|
December 8, 2023
|
OpenAI AI Assistant Alternative Using HuugingFace Models
|
|
0
|
287
|
December 7, 2023
|
Can I do a DPO training on a synthetic dataset?
|
|
0
|
408
|
December 6, 2023
|