|
Distributed Muon – Reproducibility Dataset (Chrome Traces, Scripts, Figures, Logs)
|
|
0
|
7
|
November 30, 2025
|
|
GETTING ERROR >> AttributeError: 'InferenceClient' object has no attribute 'post'
|
|
18
|
1969
|
November 30, 2025
|
|
Cannot upload new arxiv paper
|
|
3
|
37
|
November 29, 2025
|
|
Interest in Contributing PEFT Educational Resources - Seeking Community Input
|
|
4
|
74
|
November 29, 2025
|
|
Reproducing & Validating Distributed Muon (MoonshotAI) — Performance & Communication Results
|
|
0
|
6
|
November 29, 2025
|
|
Notifications Going to Old Email Address - Not Updated
|
|
2
|
296
|
November 30, 2025
|
|
Model Produces Chaotic / Repetitive Output When `top_k` Is Higher — How to Fix This
|
|
0
|
7
|
November 29, 2025
|
|
Beyond Correction: Epistemic Safety as a Mediator for Policy Transfer in Large Language Models
|
|
0
|
5
|
November 29, 2025
|
|
Building AI Agent for DevOps Daily business in IT Company
|
|
1
|
18
|
November 29, 2025
|
|
Trainer taking a long time to start
|
|
7
|
24
|
November 30, 2025
|
|
What is the best way to write the mode instruction for an AI?
|
|
4
|
37
|
November 28, 2025
|
|
Provider="auto" always routes DeepSeek model to Novita even when Novita is disabled — ignoring org settings
|
|
1
|
15
|
November 27, 2025
|
|
How to understand the special tokens?
|
|
1
|
14
|
November 28, 2025
|
|
My Master’s research turned into a PyTorch layer that calms down unstable Transformers
|
|
4
|
61
|
November 29, 2025
|
|
Built an AI that uses block-code & how it works
|
|
3
|
23
|
November 28, 2025
|
|
I ran a 36B Parameter Model with this compressor file I made
|
|
1
|
39
|
November 30, 2025
|
|
Fail to claim authorship of the paper
|
|
36
|
426
|
November 28, 2025
|
|
HF Space stuck at Starting
|
|
3
|
20
|
November 28, 2025
|
|
Gpt-oss training on A100 - OOM error
|
|
9
|
68
|
November 27, 2025
|
|
Persistent 'websockets.asyncio' Error in Gradio Space with yfinance。
|
|
2
|
20
|
November 26, 2025
|
|
SOTA Pure Dense Retrieval on BEIR: Beating Hybrid Methods with Nomic Embed v1.5
|
|
3
|
18
|
November 29, 2025
|
|
ChatGPT 5 : A open discussion
|
|
13
|
225
|
November 25, 2025
|
|
Multimodal Prefix Caching with Qwen3-VL
|
|
2
|
47
|
November 26, 2025
|
|
LLM Course - Chapter 1- How to Properly Run Transformers Notebooks in SageMaker Lab?
|
|
3
|
20
|
November 26, 2025
|
|
How to download Llama-3.1-8B-Instruct-CoreMl model
|
|
2
|
30
|
November 27, 2025
|
|
ERROR: failed to push spaces-registry-us.huggingface.tech
|
|
22
|
259
|
November 12, 2025
|
|
Card forensics using VLM localized prompt approach
|
|
3
|
12
|
November 29, 2025
|
|
How to Evaluate Fine-Tuned LLMs?
|
|
3
|
19
|
November 27, 2025
|
|
ContractNLI-based NDA Risk Analyzer using RoBERTa + Chunking – Looking for Feedback
|
|
6
|
36
|
November 25, 2025
|
|
Struggling to install huggingface_hub
|
|
5
|
66
|
November 26, 2025
|