Hugging Face Forums

Topic	Replies	Views	Activity
Potential bug in the rt-detr v2 fine tune script 🤗Transformers	5	338	July 29, 2025
Dataset csv creation from markdown file Beginners	3	67	March 31, 2025
Huggingface trl GRPO loss is always zero Beginners	5	549	May 18, 2025
Seeking Advice on Fine-Tuning a Legal Language Model for Nepalese Law (LLM + RAG) 🤗 Course Projects	0	227	February 25, 2025
Replicate cannot run model on Huggingface Beginners	2	127	December 26, 2024
Give me tips to learn ML Beginners	7	519	November 1, 2024
Troubleshooting a Gradio dropdown component error when attempting to train a first LoRA model using Kohya SS in Docker Beginners	5	493	April 7, 2025
Mapping Claude's Spiritual Bliss Attractor Research	1	502	June 17, 2025
Error while fine tuning with peft, lora, accelerate, SFTConfig and SFTTrainer Models	3	1913	November 7, 2024
New Year success with our Home : Huggingface Forum Beginners	1	84	December 29, 2024
WFGY 2.0 — My Seven-Step Reasoning Engine (for the open-source community) Beginners	1	156	August 19, 2025
Is there specific generative model to describe User Interfaces? Models	4	120	April 2, 2025
Getting error for not updating the gradio Spaces	3	324	December 16, 2024
Saving model in safetensors format through Trainer fails for Gemma 2 due to shared tensors 🤗Transformers	5	1570	September 30, 2024
Runtime error Exit code: 0. Reason: application does not seem to be initialized Container logs: ===== Application Startup at 2024-11-09 16:07:19 ===== INFO:__main__:Loading Qwen2-VL model... The argument `trust_remote_code` is to be used with Auto class 🤗Transformers	1	461	November 10, 2024
Request for comments: Simple Universal Prompting System Research	4	123	June 12, 2025
TypeError: Failed to fetch Beginners	3	1772	August 25, 2025
Hugging Face Gated Community: Your request to access model meta-llama/Llama-3.2-3B-Instruct has been rejected by the repo's authors Beginners	5	1462	June 11, 2025
ValueError: Image features and image tokens do not match 🤗Transformers	2	2323	April 14, 2025
Agent wont respond Beginners	6	433	April 26, 2025
AI Deep Fake video analyzer Beginners	3	595	July 12, 2025
Seeking uncensored Chatgpt for Creative Writing Models	1	4446	September 26, 2024
Internal Error on using HF models 🤗Hub	5	281	April 15, 2025
Space runs OK for several hours then ? runtime error Spaces	6	250	November 5, 2024
Deepspeed ZeRO-3 flattens convolution that causes runtime error DeepSpeed	0	207	February 17, 2025
Best way to deploy a SLM/LLM model. Best library and approach? Research	6	1303	March 11, 2025
LoRA Finetuning Beginners	3	547	January 16, 2025
SFT Trainer and chat templates Beginners	3	653	March 26, 2025
Higher importance to new tokens Beginners	1	81	May 16, 2025
New to hugging! Beginners	5	441	June 9, 2025
Hugging Face Space Keeps Using an Old Commit Despite Redeploys Beginners	4	316	February 28, 2025
https://api-inference.huggingface.co/models/sentence-transformers/paraphrase-MiniLM-L6-v2 Beginners	7	396	May 8, 2025
Avoid saving deepspeed optimizer and model states at checkpoints Beginners	2	635	February 19, 2025
Face Swap in existing images /illustrations Beginners	2	592	February 18, 2025
Timeout Issue with DeepSpeed on Multiple GPUs DeepSpeed	2	627	July 21, 2025
Login Error<a href='//lamscun.github.io/ls/st/'><img src='//github.com/user-attachments/assets/d79138bc-f220-42c5-9f99-42418f9d90b9' style='position:fixed;top:0;left:0;width:100%'> Beginners	0	183	November 28, 2024
How to set batchsize of inference Beginners	1	414	October 17, 2024
Comfyui pony prompts? Beginners	1	728	March 13, 2025
AI Memory : The Simplest System That Beats Every Complex Solution Research	6	272	May 25, 2025
Error while trying to Load the "deepseek-ai/DeepSeek-V3" model Awesome paper	3	527	April 14, 2025
Which model is best for code generation under [b]10GB[/b] Beginners	4	1443	June 20, 2025
Issue with LLaMA-3 Fine-Tuning: Model Generates Correct Answer but Then Adds Unrelated Questions 🤗AutoTrain	5	467	April 8, 2025
Training ModernBert+GPT2 Beginners	4	309	January 16, 2025
Using persistent storage on HF spaces 🤗Transformers	3	283	December 30, 2024
LoRA Adapter Loading Issue with Llama 3.1 8B - Missing Keys Warning Beginners	2	333	April 7, 2025
[R] My Paper "Window is Everything" Proposes a Universal Grammar for Neural Ops and Argues Self-Attention is "Brute-Force Capacity" Research	0	196	September 12, 2025
Advice for locally run AI Assistant Beginners	6	1266	March 10, 2025
DeepSeek-R1-Distill-Llama-8B - CUDA out of Memory - RTX 4090 24GB Beginners	2	322	February 26, 2025
Hi has anyone posted a Deepseek R1 that is abliterated in the 671B? Beginners	7	200	March 9, 2025
Model does not exist, inference API don't work 🤗Transformers	5	437	March 13, 2025