|
Attentions not returned from transformers ViT model when using output_attentions=True
|
|
5
|
1201
|
March 2, 2026
|
|
Wave Field LLM — O(n log n) attention via wave equation dynamics, within 5% of standard transformer
|
|
2
|
3042
|
March 2, 2026
|
|
Looking for full-precision (non-GGUF) 128k models for LoRA fine-tuning!
|
|
1
|
5
|
March 3, 2026
|
|
Why are gradient_checkpointing and training bound?
|
|
1
|
6
|
March 2, 2026
|
|
Resonant Intelligence: Born Rule Fusion of Large Language Model Ensembles — Quantum-Inspired Interference for LLM Ensembles
|
|
0
|
10
|
March 2, 2026
|
|
What if the first 20% of fine-tuning steps ran on CPU? Train loss dropped 22.5% — and I can't explain why
|
|
2
|
28
|
March 1, 2026
|
|
AI Explained Why LLMs Suddenly “Understand"
|
|
3
|
97
|
March 1, 2026
|
|
Random invites after setting organization email domain
|
|
0
|
4
|
March 2, 2026
|
|
Gradio Chat of my Ai-Inclusive novel - looking for beta testers
|
|
9
|
77
|
March 2, 2026
|
|
RAM usage, Model streaming or alternatives
|
|
4
|
83
|
March 1, 2026
|
|
Issue with summarization and translation pipeline
|
|
3
|
11
|
March 2, 2026
|
|
Using hyperparameter-search in Trainer
|
|
102
|
38887
|
March 2, 2026
|
|
About traning LoRa for Z Image Turbo
|
|
1
|
20
|
March 1, 2026
|
|
Improve meta on rich links to HF papers
|
|
1
|
14
|
March 2, 2026
|
|
CfC‑based hallucination detector and dataset‑composition experiments (math‑ratio → stability, accuracy, hallucinations) — reproducible code + plots
|
|
0
|
8
|
March 1, 2026
|
|
Python Gradio Space stuck in "restarting"
|
|
0
|
6
|
March 1, 2026
|
|
Fail to claim authorship of the paper
|
|
46
|
714
|
March 3, 2026
|
|
Unable to purchase pro plan
|
|
0
|
6
|
March 1, 2026
|
|
Docker Spaces stuck in "Building" with empty logs - all Docker builds affected
|
|
1
|
11
|
February 28, 2026
|
|
An AI streaming "buddy" like Neuro-sama
|
|
5
|
78
|
March 3, 2026
|
|
RLVR for code execution prediction
|
|
1
|
20
|
February 27, 2026
|
|
Dualist - Othello AI - Feedback
|
|
2
|
10
|
March 1, 2026
|
|
I'm new here. I created a space, but it just stays on "Building." Even new spaces aren't loading
|
|
2
|
21
|
February 27, 2026
|
|
Seeking Advice: Qwen3.5-27B failing on Inference Endpoints — is Unsloth GGUF a viable alternative for text editing?
|
|
1
|
43
|
February 28, 2026
|
|
I Built a Persona Library to Assign Expert Roles to Your Prompts
|
|
0
|
21
|
March 1, 2026
|
|
Share and make a dataset of Youtube videos publicly available with a link in research paper
|
|
3
|
22
|
February 26, 2026
|
|
Since Gradio 6.4 -> 6.8.0 Custom Domain doesnt work
|
|
1
|
18
|
February 28, 2026
|
|
Newbie needs help with returned data type (is not an image)
|
|
4
|
29
|
February 26, 2026
|
|
SKA Explorer with an interactive UI
|
|
0
|
11
|
February 28, 2026
|
|
Olive vs optimum
|
|
1
|
11
|
February 27, 2026
|