Best model for translating English to Japanese
|
|
4
|
1744
|
December 22, 2024
|
No Biases for Llama-3.2-3B-Instruct
|
|
0
|
38
|
December 22, 2024
|
RT-DETR attention map dimension - PekingU/rtdetr_r50vd
|
|
0
|
32
|
December 20, 2024
|
Chatbot PDF - using flan-t5-large model
|
|
0
|
63
|
December 20, 2024
|
How to build a custom question-answering head?
|
|
3
|
1951
|
December 19, 2024
|
How to long to get access to Paligemma 2 gated repo
|
|
1
|
78
|
December 19, 2024
|
How to get access gated repo
|
|
1
|
76
|
December 19, 2024
|
Why there are way more merges then vocabulary tokens in llama-3 model tokenizer?
|
|
1
|
131
|
December 18, 2024
|
CUDA convert GUFF to CUDA GUFF
|
|
6
|
97
|
December 18, 2024
|
Repetitive words in model output
|
|
1
|
42
|
December 18, 2024
|
Confused in no of ResNet blocks in up blocks and no of channels for Unet2D model of diffusers
|
|
0
|
40
|
December 18, 2024
|
Qwen 2.5 coder 7b can't use correct separators
|
|
1
|
94
|
December 16, 2024
|
How do you use Beam Search in Whisper correctly?
|
|
3
|
761
|
December 15, 2024
|
Your request to access this repo has been successfully submitted, and is pending a review from the repo's authors
|
|
11
|
34396
|
December 15, 2024
|
Model crashing with a 1.6 MB txt file?
|
|
0
|
15
|
December 14, 2024
|
Datacamp course token error
|
|
0
|
13
|
December 13, 2024
|
Whisper fine-tuning and retaining timestamp decoding
|
|
5
|
1249
|
December 12, 2024
|
AI disappointment: Why Llama 3.2 (3b version) loses out to Chat-GPT - An analysis of the limitations of Llama 3.2 (3b version) compared to Chat-GPT
|
|
5
|
2616
|
December 12, 2024
|
Fine-Tuned unsloth/Qwen2.5-1.5B Model Generating Unexpected Exclamation Marks
|
|
3
|
248
|
December 10, 2024
|
LLM models to train Aspect-based Sentiment Analysis in German Language
|
|
0
|
62
|
December 9, 2024
|
Recursion in LLM's
|
|
4
|
199
|
December 9, 2024
|
Why does PALIGemma use 256 tokens for a 224x224 image
|
|
0
|
27
|
December 8, 2024
|
I need some recommendation or advice on a fast vqa (visual question answering) model. I really don't know how to look for them
|
|
0
|
69
|
December 7, 2024
|
Looking for a Tiny LLM (max 1.5GB) – Need Advice
|
|
6
|
5185
|
December 6, 2024
|
Using detr with custom backbone
|
|
3
|
564
|
December 6, 2024
|
Pretraining T5 from scratch using MLM
|
|
1
|
371
|
December 6, 2024
|
Why the memory usage is higher than expected when loading nvidia/NV-Embed-v2 model with FP16 precision?
|
|
0
|
87
|
December 6, 2024
|
Can we run custom quantized llama3-8b on Npu?
|
|
0
|
54
|
December 6, 2024
|
Make 5 minute video and speech from text story
|
|
0
|
59
|
December 5, 2024
|
Checkout pre-trained models from ClearerVoice-Studio
|
|
0
|
50
|
December 4, 2024
|