Loss spike when resuming from FSDP SHARDED_STATE_DICT checkpoint (possible optimizer-state mismatch)
|
|
1
|
12
|
June 28, 2025
|
How to decode CSM tokens into audio tensors for streaming
|
|
1
|
13
|
June 23, 2025
|
Model does not exist while running chat-ui through docker
|
|
1
|
13
|
June 22, 2025
|
Subject: Access Request - Phi-4-multimodal-instruct
|
|
1
|
11
|
June 19, 2025
|
Double charge for Pro plan without activation – need support
|
|
1
|
12
|
June 10, 2025
|
Question about using OpenCV on Hugging Face Spaces (Licensing/Legality)
|
|
1
|
12
|
June 7, 2025
|
Dario Schiraldi : Can I use chatbots to automate customer service?
|
|
1
|
11
|
June 2, 2025
|
AttributeError: 'CustomQwen3Model' object has no attribute 'config'
|
|
1
|
12
|
May 16, 2025
|
Page jump issue
|
|
1
|
11
|
May 12, 2025
|
Advice on tech stack
|
|
0
|
32
|
September 12, 2024
|
Account unable to login error
|
|
0
|
28
|
September 1, 2024
|
What is the command to clone llma3 model?
|
|
0
|
27
|
August 30, 2024
|
Pytorch 2.2.0 release of AWS deep learning containers
|
|
0
|
29
|
August 19, 2024
|
Accelerate natively compatible with datasets
|
|
0
|
30
|
July 19, 2024
|
Autotrain nvidia dgx cloud not working
|
|
0
|
35
|
July 17, 2024
|
Image Regression (multivalue)
|
|
0
|
29
|
July 16, 2024
|
I need help,Please give me the best advice
|
|
1
|
20
|
April 24, 2025
|
Training a model, sharing a link
|
|
1
|
20
|
October 16, 2024
|
[CALL FOR PARTICIPANTS] Dissertation Research
|
|
0
|
45
|
March 31, 2025
|
[ Open Source] Notate - A Desktop Application Combining Reasoning, Agents, VectorStorage, Local deployment and more!
|
|
0
|
26
|
January 28, 2025
|
Can't get an SSL secure Uvicorn server
|
|
0
|
28
|
September 5, 2024
|
Eval_loss error on evaluation at the first epoch
|
|
0
|
33
|
August 28, 2024
|
Updating model and tokenizers inside Trainer.train
|
|
0
|
34
|
August 23, 2024
|
Downloading clip extremely takes long time
|
|
0
|
27
|
August 19, 2024
|
Dynamically resizing input for Huggingface's generate()
|
|
0
|
31
|
August 2, 2024
|
Mistral or LLaMA for hardware design bot?
|
|
0
|
26
|
July 24, 2024
|
Saving Fine-tune Falcon Model
|
|
0
|
36
|
July 15, 2024
|
Looking for feedback on an early-stage customizable AI agent platform
|
|
1
|
18
|
May 11, 2025
|
Help me!!Where is the data of o1-1217 in the DeepSeek paper that I have never found?
|
|
1
|
20
|
April 9, 2025
|
Apollo Video Understanding Models
|
|
1
|
11
|
July 4, 2025
|
AutoTrain Advanced Error on Massive Dataset
|
|
1
|
10
|
July 2, 2025
|
How can I tell what each dataset was used for?
|
|
1
|
11
|
June 30, 2025
|
Where can I find wildchat-50m judgement data documentation?
|
|
1
|
11
|
June 27, 2025
|
Lost files in a 3 month old reo
|
|
1
|
11
|
June 25, 2025
|
I cant access deepsite. its asking me to login but showing 404 error
|
|
1
|
10
|
June 22, 2025
|
What’s the best open-source model to generate safe, structured, and empathetic rehab prompts for Parkinson’s patients in a LangChain-based PT coaching agent (starting with Mistral, Zephyr, or something healthcare-tuned)?
|
|
1
|
10
|
June 18, 2025
|
Copyright policy regarding youtube datasets
|
|
1
|
10
|
June 12, 2025
|
Error in installing pip install -e ".[dev]"
|
|
1
|
12
|
June 5, 2025
|
I am getting this error initiating Livebook
|
|
1
|
11
|
May 26, 2025
|
Why does chunked dataset training give different results compared to full-batch training in my Siren model?
|
|
1
|
14
|
February 20, 2025
|
Job Title: Tier 1 AI/Data Engineer - LegalTech Startup (Rapid 2-Month Impact)
|
|
2
|
39
|
March 6, 2025
|
Questions about training bert with two columns data
|
|
0
|
28
|
September 21, 2024
|
Regarding Diffusion for Audio
|
|
0
|
25
|
September 5, 2024
|
Simplifying AI Model Discovery with a Mobile App: Each Model as a Virtual Person
|
|
0
|
26
|
September 2, 2024
|
How to Fine-Tune mBART or mT5 for Transliteration from Romanized Text to Native Script?
|
|
0
|
25
|
August 26, 2024
|
Nobody so far as I know is pushing the utility potential of LLMs: Some ideas
|
|
0
|
25
|
August 16, 2024
|
Colab load text-to-image model cause oom
|
|
0
|
30
|
August 4, 2024
|
DatasetInfo seems to be missing when I pull my dataset from HFHub
|
|
0
|
27
|
July 17, 2024
|
Text classification of RSS articles
|
|
3
|
12
|
July 4, 2025
|
Cant import packages.txt
|
|
2
|
8
|
June 17, 2025
|