Fine tune "meta-llama/Llama-2-7b-hf" Bug:RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:1 and cuda:0! (when checking argument for argument target in method wrapper_CUDA_nll_loss_forward)
|
|
15
|
207
|
December 6, 2024
|
Need a Model for Extracting Relevant Keywords for Given Titles
|
|
1
|
519
|
December 6, 2024
|
Why does moving ML model initialization into a function prevent GPU OOM errors when del, gc.collect(), and torch.cuda.empty_cache() fail?
|
|
0
|
114
|
December 5, 2024
|
Pretrained Models to Heroku Production Environment
|
|
5
|
1835
|
July 10, 2020
|
Searching Keywords by relatively long text
|
|
1
|
686
|
December 5, 2024
|
Computational needs for AI/ML Researchers
|
|
0
|
30
|
December 5, 2024
|
UniDecodeError: 'charmap' codec can't decode byte from Load_dataset
|
|
0
|
63
|
December 5, 2024
|
Bad Performance Finetuning Llama Chat and Instruct Models on GSM8K
|
|
5
|
1407
|
December 5, 2024
|
This for creating a ai model for myself
|
|
0
|
23
|
December 5, 2024
|
Help Making most logical and rational thinking AI
|
|
0
|
41
|
December 5, 2024
|
Expanding an Audio Dataset with datasets.map()?
|
|
4
|
776
|
December 5, 2024
|
Upload_large_folder() issue with uploading to spaces
|
|
4
|
199
|
December 5, 2024
|
PLOBLEM https://github.com/huggingface/transformers.git
|
|
1
|
56
|
December 5, 2024
|
Unsure if correctly loading .from_pretrained models
|
|
0
|
29
|
December 4, 2024
|
LMM fine tuning how to improve training for Question and Answer task?
|
|
0
|
33
|
December 4, 2024
|
Gradio interface suddenly stop functioning and don’t render properly - Python
|
|
6
|
166
|
December 4, 2024
|
CUDA OOM error when using data-distributed mode on AWS p4d.24xlarge instance
|
|
7
|
351
|
December 4, 2024
|
Get all unique labels values in a sorted manner
|
|
2
|
1998
|
December 4, 2024
|
Login issues! Just happened now
|
|
3
|
170
|
December 4, 2024
|
Loading a previous checkpoint in get_peft_model (whisper large-v2)
|
|
0
|
53
|
December 4, 2024
|
Determining size of a logits
|
|
0
|
38
|
December 4, 2024
|
Further Insights on Token Issue with /models Endpoint
|
|
1
|
27
|
December 4, 2024
|
Persistent data is overwritten?
|
|
0
|
23
|
December 3, 2024
|
How to resolve naming conflict between personal account and new organization?
|
|
4
|
159
|
December 3, 2024
|
Using trainer to fine-tune the model gives an error. Seeking solution!
|
|
1
|
115
|
December 3, 2024
|
How should a Absolute Beginners Start learning ML/LLM in 2024
|
|
6
|
8553
|
December 2, 2024
|
Zephyr tags in repsonse, after fin etuning
|
|
1
|
46
|
December 2, 2024
|
The memory usage about inference on CPU
|
|
0
|
23
|
December 2, 2024
|
Perfectly the same code, single GPU OK, multi GPU ERROR
|
|
0
|
90
|
December 1, 2024
|
Don't know where to ask
|
|
1
|
173
|
December 1, 2024
|