I cannot download any large models stored in xet with Brave or MS Edge for weeks
|
|
2
|
47
|
August 12, 2025
|
Llama 3.2 3B instruct model giving wrong answer
|
|
5
|
606
|
August 12, 2025
|
503 server Error
|
|
1
|
71
|
August 12, 2025
|
Whisper fine-tuning slow eval
|
|
2
|
497
|
August 11, 2025
|
Can't get my uzu model.... got the token but still gated?
|
|
0
|
20
|
August 11, 2025
|
Getting this error while trying to access Mistral model using Streamlit
|
|
1
|
51
|
August 11, 2025
|
Error while initializing ZeroGPU
|
|
13
|
264
|
August 10, 2025
|
Model access denied
|
|
2
|
36
|
August 9, 2025
|
Extend codellama
|
|
2
|
24
|
August 9, 2025
|
Correct way to pass context to llama.cpp server
|
|
2
|
1631
|
August 8, 2025
|
Whisper BaseGerman Awareai (Marksdo) finetuning(training)
|
|
1
|
18
|
August 7, 2025
|
Started a new project- creating a Mamba Hybrid Prototype
|
|
1
|
18
|
August 7, 2025
|
HFLM Python integration way slower than CLI
|
|
4
|
17
|
August 6, 2025
|
Why is context length not displayed?
|
|
5
|
67
|
August 6, 2025
|
Multi-page Document Classification
|
|
4
|
2818
|
August 5, 2025
|
CAS service error when downloading gated models on Databricks even with HF_HUB_DISABLE_XET=1
|
|
15
|
296
|
August 5, 2025
|
Sentence similarity models not capturing opposite sentences
|
|
12
|
4502
|
August 4, 2025
|
Gen 2 Deterministic AI - Tribot-9.98m-micro Public Release
|
|
6
|
52
|
August 4, 2025
|
I've built a LLM pre-processing toolbox and would love to hear your feedback
|
|
1
|
35
|
August 3, 2025
|
Why Do We Settle for Less?
|
|
36
|
412
|
August 2, 2025
|
Systematic rejection for Llama models access - Need review
|
|
3
|
74
|
August 1, 2025
|
Tool/Function calling abilities of LLM's that are used locally pulled through ollama
|
|
2
|
74
|
August 1, 2025
|
Pruned Llama on lm-evaluation-harness
|
|
1
|
185
|
August 1, 2025
|
Issues with expanding capacity of pretrained Qwen2.5-1.5B
|
|
0
|
14
|
August 1, 2025
|
Bug: Granite 4.0 Tiny Preview inference broken
|
|
1
|
17
|
July 30, 2025
|
Looking for Starter-Model Tips: What’s Your Go-To Baseline for the “DeFi-Behavior” Demo?
|
|
0
|
20
|
July 29, 2025
|
Fine tuning and it's effects on model safety
|
|
7
|
84
|
July 28, 2025
|
Is there any difference between GPT-J and GPT-2?
|
|
4
|
2811
|
July 28, 2025
|
Built my own ChatGPT-like bot – needs help with masking
|
|
2
|
104
|
July 27, 2025
|
Identify model requirements in memory and disk
|
|
1
|
56
|
July 26, 2025
|