Can we run custom quantized llama3-8b on Npu?
|
|
0
|
54
|
December 6, 2024
|
Make 5 minute video and speech from text story
|
|
0
|
59
|
December 5, 2024
|
Checkout pre-trained models from ClearerVoice-Studio
|
|
0
|
50
|
December 4, 2024
|
Qwen 2.5 coder 7b best inference hyperparameters
|
|
0
|
176
|
December 3, 2024
|
What happens if I exceed the number of tokens that fit in the widget?
|
|
0
|
33
|
December 3, 2024
|
Mistral 7b v03 - inference install - wheel error
|
|
3
|
350
|
December 2, 2024
|
Speaker Diarization
|
|
0
|
70
|
December 2, 2024
|
questions on tensor parallelism using pytorch
|
|
0
|
31
|
December 2, 2024
|
500 Internal Server Error when use AsyncInferenceClient
|
|
0
|
27
|
December 2, 2024
|
Download speeds slow on the popular Models
|
|
8
|
3859
|
December 2, 2024
|
What is the best LLM for finetuning with specific repetetive data?
|
|
0
|
101
|
November 30, 2024
|
I need a model that is good at roleplay but with set rules
|
|
1
|
323
|
November 29, 2024
|
SmolVLM 8bit Quantization Problem
|
|
3
|
344
|
November 29, 2024
|
Looks for a good text to video avatar model
|
|
2
|
237
|
November 29, 2024
|
Store LLM Responses in a File for User Download
|
|
0
|
89
|
November 28, 2024
|
Al-Toolkit Guidance required: Training LORA for realistic full-body fashion model portraits
|
|
0
|
825
|
November 27, 2024
|
NSFW for image detection
|
|
0
|
274
|
November 26, 2024
|
Best model to fine-tune for argument (consideration) extraction task?
|
|
2
|
90
|
November 25, 2024
|
Topic : Need a good model that run locally for pdf data extraction
|
|
0
|
65
|
November 24, 2024
|
How to train GPT-2 for text summarization?
|
|
4
|
9464
|
November 24, 2024
|
Memory increasing after hugging face generate method
|
|
0
|
32
|
November 24, 2024
|
{"error":"The expanded size of the tensor (524) must match the existing size (514) at non-singleton dimension 1. Target sizes: [1, 524]. Tensor sizes: [1, 514]"}
|
|
0
|
131
|
November 22, 2024
|
Model downloading speed too low
|
|
4
|
460
|
November 21, 2024
|
Volunteer Needed to help DOCTORS! Volunteer Needed to help DOCTORS! Volunteer Needed to help DOCTORS!
|
|
0
|
48
|
November 21, 2024
|
T5 as Decoder for OCR
|
|
8
|
825
|
November 20, 2024
|
0 Loss. HF trainer integration
|
|
0
|
34
|
November 20, 2024
|
Verify the correctness of implementation of KTO
|
|
0
|
62
|
November 20, 2024
|
Validation Error: Meta-Llama-3-8B-Instruct
|
|
3
|
228
|
November 19, 2024
|
Help my model return the expected data
|
|
2
|
136
|
November 18, 2024
|
Cannot use DETR without downloading backbone
|
|
0
|
85
|
November 18, 2024
|