How much VRAM and how many GPUs to fine-tune a 70B parameter model like LLaMA 3.1 locally?
|
|
1
|
191
|
April 17, 2025
|
How to Finetune Llama 3 8b Instruct on new Indian legal laws
|
|
1
|
25
|
April 16, 2025
|
Stability diffusion large turbo started giving distorted images
|
|
2
|
45
|
April 18, 2025
|
Gated Repo Access
|
|
2
|
146
|
January 12, 2025
|
Improving Sentence Embeddings
|
|
0
|
32
|
April 14, 2025
|
Link to any model is showing 500 error
|
|
1
|
58
|
April 14, 2025
|
Real-Time Text-to-Speech Model
|
|
2
|
1406
|
January 5, 2025
|
LM STUDIO very week quality with great models
|
|
3
|
687
|
February 9, 2025
|
Merged and Saved model not giving same result after loading
|
|
3
|
79
|
December 27, 2024
|
Training a CausalLM from scratch for a machine translation task
|
|
3
|
72
|
January 10, 2025
|
Make mochi-1 work
|
|
5
|
125
|
November 12, 2024
|
Error 400 Even I have access to model
|
|
3
|
307
|
January 3, 2025
|
How to request mistral:7b-instruct to skip returning context?
|
|
3
|
121
|
February 22, 2025
|
How to increase inference quota
|
|
3
|
44
|
April 4, 2025
|
Which model to use to tell a Playing Card
|
|
1
|
23
|
April 8, 2025
|
Recognising numbers in an image
|
|
3
|
37
|
April 9, 2025
|
Whisper large v3 Finetune result is all lowercased
|
|
2
|
53
|
February 11, 2025
|
YOLOv5 training doesn't work as expected
|
|
1
|
21
|
March 26, 2025
|
OCR Confidence score extraction for OpenGVLab/InternVL2_5-8B-MPO
|
|
2
|
75
|
February 6, 2025
|
Discovering best models for a task
|
|
2
|
210
|
January 26, 2025
|
Creating an Ethical AI SLM
|
|
4
|
65
|
March 6, 2025
|
Fine-Tuned unsloth/Qwen2.5-1.5B Model Generating Unexpected Exclamation Marks
|
|
3
|
311
|
December 10, 2024
|
Fine-tuning flan-t5-small
|
|
2
|
82
|
January 6, 2025
|
RT-DETR with MobileNet
|
|
1
|
51
|
February 20, 2025
|
Seeking Recommendations for an AI Model to Evaluate Photo Damage for Restoration Project
|
|
1
|
127
|
January 21, 2025
|
Why is the Inference API not working for the model I uploaded?
|
|
3
|
162
|
January 18, 2025
|
Can't load huggingface model (Clip) on subprocess
|
|
2
|
195
|
January 19, 2025
|
Can You Legally Use BlaireSilver13/youtube-thumbnail for Commercial Projects?
|
|
1
|
21
|
March 26, 2025
|
Reset model access request in HF
|
|
2
|
85
|
January 7, 2025
|
Looking for an AI model for generating text from a set of predefined words
|
|
1
|
86
|
January 21, 2025
|