|
Resource requirement to run DeepSeek R1 7B
|
|
1
|
54
|
May 9, 2025
|
|
So now I need a dedicated endpoint to test most models? (only 34k out of 1.6 million supported)
|
|
0
|
17
|
May 8, 2025
|
|
404 downloading models
|
|
3
|
73
|
May 7, 2025
|
|
Can I get clarification on what exactly transformers does vs what the model does?
|
|
4
|
74
|
May 6, 2025
|
|
How to combine Image and Text embedding for product similarity
|
|
2
|
17957
|
May 6, 2025
|
|
Related to Claude Model
|
|
1
|
43
|
May 5, 2025
|
|
Cannot export tflite using optimum for a fine-tuned gemma 3 model for task : question answering
|
|
6
|
265
|
May 5, 2025
|
|
Inquiry Regarding Out of Memory Issue During LoRA Fine-Tuning
|
|
2
|
440
|
May 5, 2025
|
|
Text 2 Video -> Wan2_1-T2V-1_3B_fp32
|
|
0
|
43
|
May 2, 2025
|
|
Gemma3TextModel weights
|
|
1
|
58
|
May 2, 2025
|
|
Error in loading Llama model
|
|
0
|
56
|
April 30, 2025
|
|
Best model for translating English to Japanese
|
|
7
|
4096
|
April 29, 2025
|
|
Announcing ConvaiCausalLM: A Foundational Hindi Causal Language Model (102M)(YAHH! SMALL)
|
|
0
|
25
|
April 28, 2025
|
|
:wink: Please tell me the top 3 models on the market for chatting WITHOUT censorship
|
|
1
|
543
|
December 24, 2024
|
|
Qwen 'padding_side = right' problem
|
|
2
|
1439
|
April 25, 2025
|
|
Docling image captioning best VLM
|
|
2
|
216
|
April 25, 2025
|
|
A Scroll for Emergent Alignment — Public Law, Not Proposal
|
|
0
|
10
|
April 25, 2025
|
|
Use hugging face models
|
|
1
|
169
|
April 24, 2025
|
|
Need Help on New Model Deployed in HuggingFace
|
|
1
|
28
|
April 24, 2025
|
|
How to make a model file for Ollama?
|
|
1
|
399
|
April 24, 2025
|
|
I need help,Please give me the best advice
|
|
1
|
28
|
April 24, 2025
|
|
Discrepancy Between Theoretical and Measured FLOPs/token for LLaMA-4 Scout 17B (MoE)
|
|
0
|
101
|
April 23, 2025
|
|
LORA Adapated Deepseek R1 not working with inference endpoints
|
|
2
|
87
|
April 22, 2025
|
|
OutOfMemoryError: CUDA out of memory(LLM) Tuning
|
|
1
|
89
|
April 22, 2025
|
|
What if Claude becomes more than Claude?
|
|
5
|
21
|
April 21, 2025
|
|
mistralai/Mistral-7B-v0.1 is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
|
|
3
|
2438
|
April 21, 2025
|
|
What do you think of Collaboration between LLMs?
|
|
2
|
35
|
April 21, 2025
|
|
How to achieve data crawling and large model integration?
|
|
0
|
32
|
April 18, 2025
|
|
Stability diffusion large turbo started giving distorted images
|
|
2
|
72
|
April 18, 2025
|
|
Best model to read codes from small torn paper snippets (OCR)
|
|
1
|
100
|
April 17, 2025
|