Announcing ConvaiCausalLM: A Foundational Hindi Causal Language Model (102M)(YAHH! SMALL)
|
|
0
|
14
|
April 28, 2025
|
:wink: Please tell me the top 3 models on the market for chatting WITHOUT censorship
|
|
1
|
431
|
December 24, 2024
|
Qwen 'padding_side = right' problem
|
|
2
|
1158
|
April 25, 2025
|
Docling image captioning best VLM
|
|
2
|
172
|
April 25, 2025
|
A Scroll for Emergent Alignment — Public Law, Not Proposal
|
|
0
|
5
|
April 25, 2025
|
Use hugging face models
|
|
1
|
143
|
April 24, 2025
|
Need Help on New Model Deployed in HuggingFace
|
|
1
|
20
|
April 24, 2025
|
How to make a model file for Ollama?
|
|
1
|
285
|
April 24, 2025
|
I need help,Please give me the best advice
|
|
1
|
21
|
April 24, 2025
|
Discrepancy Between Theoretical and Measured FLOPs/token for LLaMA-4 Scout 17B (MoE)
|
|
0
|
76
|
April 23, 2025
|
LORA Adapated Deepseek R1 not working with inference endpoints
|
|
2
|
76
|
April 22, 2025
|
OutOfMemoryError: CUDA out of memory(LLM) Tuning
|
|
1
|
82
|
April 22, 2025
|
What if Claude becomes more than Claude?
|
|
5
|
20
|
April 21, 2025
|
mistralai/Mistral-7B-v0.1 is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
|
|
3
|
2351
|
April 21, 2025
|
What do you think of Collaboration between LLMs?
|
|
2
|
18
|
April 21, 2025
|
How to achieve data crawling and large model integration?
|
|
0
|
28
|
April 18, 2025
|
Stability diffusion large turbo started giving distorted images
|
|
2
|
52
|
April 18, 2025
|
Best model to read codes from small torn paper snippets (OCR)
|
|
1
|
82
|
April 17, 2025
|
Padding Token Missing from LLaMA
|
|
1
|
249
|
April 17, 2025
|
How much VRAM and how many GPUs to fine-tune a 70B parameter model like LLaMA 3.1 locally?
|
|
1
|
482
|
April 17, 2025
|
How to Finetune Llama 3 8b Instruct on new Indian legal laws
|
|
1
|
42
|
April 16, 2025
|
Martin Kratt (Wild Kratts) Model request
|
|
0
|
8
|
April 16, 2025
|
404 - "{\"error\":\"Model XLabs-AI/flux-RealismLora does not exist\"}"
|
|
9
|
398
|
April 16, 2025
|
This discussion is about troubleshooting a "Not Found" error when using the Hugging Face Inference API with the google/gemma-3-27b-it model for image and text-based requests.
|
|
1
|
122
|
April 16, 2025
|
The model mistralai/Mistral-7B-Instruct-v0.1 is too large to be loaded automatically (14GB > 10GB)
|
|
2
|
193
|
April 15, 2025
|
How to train FLUX.1 for custom emoji generation — dataset size, script, and deployment?
|
|
1
|
51
|
April 8, 2025
|
Finetuning BioGPT: Encountering Out of Memory error during evaluation
|
|
1
|
49
|
April 15, 2025
|
Improving Sentence Embeddings
|
|
0
|
33
|
April 14, 2025
|
How is the data shifted by one token during CausalLM fine tuning
|
|
4
|
3248
|
April 14, 2025
|
Inference Widget “Model doesn’t exist” error for public model (was working before)
|
|
2
|
60
|
April 14, 2025
|