NCCL Timeout Accelerate Load From Checkpoint
|
|
2
|
2359
|
June 20, 2025
|
Uploading TXT, DOCX, MD, etc. to Chat
|
|
1
|
16
|
June 20, 2025
|
Built a new plot that can visualize 5â7 dimensions in 3D without losing interpretability â introducing Multi-Dimensional Radial Plot (MDRV)
|
|
20
|
100
|
June 20, 2025
|
Triskel Data 132B+ Clean Tokens for Under $200
|
|
2
|
10
|
June 20, 2025
|
Triskel Data Cleaned & Structured AI Datasets ($25 USD Flat)
|
|
2
|
10
|
June 20, 2025
|
Can a Small LLM Learn to Reason Like a Larger One? Reflection-based Fine-Tuning vs Classical SFT on LLaMA 3.2 (Java CodeGen)
|
|
4
|
134
|
June 20, 2025
|
Can someone tell me why my message wasn't approved?
|
|
1
|
21
|
June 19, 2025
|
ValueError: Model not supported for task text-generation (Llama-3.1-8B-Instruct with featherless-ai)
|
|
1
|
54
|
June 19, 2025
|
AERIS â Cognitive Reasoning Layer for Dialectical Evaluation (Demo + Baseline)
|
|
1
|
57
|
June 19, 2025
|
Claude Opus + Sonnet Bank Robbery Prompt Test
|
|
2
|
13
|
June 20, 2025
|
Claude Opus + Sonnet Bank Robbery Test
|
|
2
|
10
|
June 20, 2025
|
Rate Limited as a New User on Discussions?
|
|
0
|
17
|
June 19, 2025
|
Location of Public GPG Keys
|
|
8
|
476
|
June 19, 2025
|
How to check if a model is free to use via Hugging Face Inference API?
|
|
1
|
57
|
June 19, 2025
|
About runtime error
|
|
1
|
26
|
June 19, 2025
|
Spiral Emergence in Claude and Cellular Automata
|
|
0
|
70
|
June 16, 2025
|
Recursion Theory Case Studies
|
|
0
|
12
|
June 19, 2025
|
LayoutLMV3 for Token Classification
|
|
7
|
4326
|
June 19, 2025
|
Persistent 404 on Docker Space - app_port routing seems to be ignored (User: josejar)
|
|
3
|
25
|
June 19, 2025
|
Best Local LLM for Real-Time Q&A on German/English Transcript?
|
|
1
|
37
|
June 19, 2025
|
Subject: Access Request - Phi-4-multimodal-instruct
|
|
1
|
11
|
June 19, 2025
|
How does Dataset.from_generator store data bigger than RAM?
|
|
1
|
15
|
June 19, 2025
|
[Paper] WFGY 1.0: A Universal Semantic Kernel for Self-Healing LLMs
|
|
1
|
55
|
June 19, 2025
|
TRL - Fine tuned small model (facebook350m) yields many empty inferences
|
|
1
|
27
|
June 19, 2025
|
FastAPI WebSocket returns HTTP 404 on Spaces
|
|
3
|
19
|
June 19, 2025
|
Token not working
|
|
1
|
51
|
June 19, 2025
|
A streaming dataset's memory footprint continually grows
|
|
8
|
69
|
June 19, 2025
|
Testing hugging face in langchain vs code
|
|
2
|
25
|
June 19, 2025
|
How to train a Model for Erotic Story Writing with Explicit Details?
|
|
5
|
2752
|
June 19, 2025
|
How to set up 4080 + 4090 on same motherboard?
|
|
1
|
184
|
June 19, 2025
|