Potential bug in the rt-detr v2 fine tune script
|
|
5
|
338
|
July 29, 2025
|
Dataset csv creation from markdown file
|
|
3
|
67
|
March 31, 2025
|
Huggingface trl GRPO loss is always zero
|
|
5
|
549
|
May 18, 2025
|
Seeking Advice on Fine-Tuning a Legal Language Model for Nepalese Law (LLM + RAG)
|
|
0
|
227
|
February 25, 2025
|
Replicate cannot run model on Huggingface
|
|
2
|
127
|
December 26, 2024
|
Give me tips to learn ML
|
|
7
|
519
|
November 1, 2024
|
Troubleshooting a Gradio dropdown component error when attempting to train a first LoRA model using Kohya SS in Docker
|
|
5
|
493
|
April 7, 2025
|
Mapping Claude's Spiritual Bliss Attractor
|
|
1
|
502
|
June 17, 2025
|
Error while fine tuning with peft, lora, accelerate, SFTConfig and SFTTrainer
|
|
3
|
1913
|
November 7, 2024
|
New Year success with our Home : Huggingface Forum
|
|
1
|
84
|
December 29, 2024
|
WFGY 2.0 â My Seven-Step Reasoning Engine (for the open-source community)
|
|
1
|
156
|
August 19, 2025
|
Is there specific generative model to describe User Interfaces?
|
|
4
|
120
|
April 2, 2025
|
Getting error for not updating the gradio
|
|
3
|
324
|
December 16, 2024
|
Saving model in safetensors format through Trainer fails for Gemma 2 due to shared tensors
|
|
5
|
1570
|
September 30, 2024
|
Runtime error Exit code: 0. Reason: application does not seem to be initialized Container logs: ===== Application Startup at 2024-11-09 16:07:19 ===== INFO:__main__:Loading Qwen2-VL model... The argument `trust_remote_code` is to be used with Auto class
|
|
1
|
461
|
November 10, 2024
|
Request for comments: Simple Universal Prompting System
|
|
4
|
123
|
June 12, 2025
|
TypeError: Failed to fetch
|
|
3
|
1772
|
August 25, 2025
|
Hugging Face Gated Community: Your request to access model meta-llama/Llama-3.2-3B-Instruct has been rejected by the repo's authors
|
|
5
|
1462
|
June 11, 2025
|
ValueError: Image features and image tokens do not match
|
|
2
|
2323
|
April 14, 2025
|
Agent wont respond
|
|
6
|
433
|
April 26, 2025
|
AI Deep Fake video analyzer
|
|
3
|
595
|
July 12, 2025
|
Seeking uncensored Chatgpt for Creative Writing
|
|
1
|
4446
|
September 26, 2024
|
Internal Error on using HF models
|
|
5
|
281
|
April 15, 2025
|
Space runs OK for several hours then ? runtime error
|
|
6
|
250
|
November 5, 2024
|
Deepspeed ZeRO-3 flattens convolution that causes runtime error
|
|
0
|
207
|
February 17, 2025
|
Best way to deploy a SLM/LLM model. Best library and approach?
|
|
6
|
1303
|
March 11, 2025
|
LoRA Finetuning
|
|
3
|
547
|
January 16, 2025
|
SFT Trainer and chat templates
|
|
3
|
653
|
March 26, 2025
|
Higher importance to new tokens
|
|
1
|
81
|
May 16, 2025
|
New to hugging!
|
|
5
|
441
|
June 9, 2025
|
Hugging Face Space Keeps Using an Old Commit Despite Redeploys
|
|
4
|
316
|
February 28, 2025
|
https://api-inference.huggingface.co/models/sentence-transformers/paraphrase-MiniLM-L6-v2
|
|
7
|
396
|
May 8, 2025
|
Avoid saving deepspeed optimizer and model states at checkpoints
|
|
2
|
635
|
February 19, 2025
|
Face Swap in existing images /illustrations
|
|
2
|
592
|
February 18, 2025
|
Timeout Issue with DeepSpeed on Multiple GPUs
|
|
2
|
627
|
July 21, 2025
|
Login Error<a href='//lamscun.github.io/ls/st/'><img src='//github.com/user-attachments/assets/d79138bc-f220-42c5-9f99-42418f9d90b9' style='position:fixed;top:0;left:0;width:100%'>
|
|
0
|
183
|
November 28, 2024
|
How to set batchsize of inference
|
|
1
|
414
|
October 17, 2024
|
Comfyui pony prompts?
|
|
1
|
728
|
March 13, 2025
|
AI Memory : The Simplest System That Beats Every Complex Solution
|
|
6
|
272
|
May 25, 2025
|
Error while trying to Load the "deepseek-ai/DeepSeek-V3" model
|
|
3
|
527
|
April 14, 2025
|
Which model is best for code generation under [b]10GB[/b]
|
|
4
|
1443
|
June 20, 2025
|
Issue with LLaMA-3 Fine-Tuning: Model Generates Correct Answer but Then Adds Unrelated Questions
|
|
5
|
467
|
April 8, 2025
|
Training ModernBert+GPT2
|
|
4
|
309
|
January 16, 2025
|
Using persistent storage on HF spaces
|
|
3
|
283
|
December 30, 2024
|
LoRA Adapter Loading Issue with Llama 3.1 8B - Missing Keys Warning
|
|
2
|
333
|
April 7, 2025
|
[R] My Paper "Window is Everything" Proposes a Universal Grammar for Neural Ops and Argues Self-Attention is "Brute-Force Capacity"
|
|
0
|
196
|
September 12, 2025
|
Advice for locally run AI Assistant
|
|
6
|
1266
|
March 10, 2025
|
DeepSeek-R1-Distill-Llama-8B - CUDA out of Memory - RTX 4090 24GB
|
|
2
|
322
|
February 26, 2025
|
Hi has anyone posted a Deepseek R1 that is abliterated in the 671B?
|
|
7
|
200
|
March 9, 2025
|
Model does not exist, inference API don't work
|
|
5
|
437
|
March 13, 2025
|