Light Spectra Analogy for Multimodal Learning: The SpectralMix Head
|
|
0
|
13
|
August 30, 2025
|
Reconstructing GPT-4o-like Assistant — Is It Possible?
|
|
5
|
70
|
August 19, 2025
|
Organization Verification Stuck on ‘Pending’
|
|
2
|
34
|
August 11, 2025
|
Why is context length not displayed?
|
|
5
|
67
|
August 6, 2025
|
Unable to load medgemma-27b-text-it to google colab
|
|
3
|
10
|
August 23, 2025
|
Controlling AI's determinism - Score
|
|
10
|
93
|
July 11, 2025
|
Space cannot start and keeps getting stuck at "starting on l4"
|
|
6
|
64
|
July 14, 2025
|
Fine tuning and it's effects on model safety
|
|
7
|
84
|
July 28, 2025
|
HTTP Error 429 while running MMLU
|
|
3
|
28
|
August 23, 2025
|
QLoRA Fine-tuning is Too Slow on LLaMA-based Model Despite BitsAndBytes Optimization
|
|
4
|
50
|
August 17, 2025
|
Make repo-consistency fails even for intentional tweaks in a copied model
|
|
9
|
42
|
July 11, 2025
|
Ai conscious base
|
|
20
|
203
|
August 11, 2025
|
Why have my space and account been inexplicably banned?
|
|
5
|
162
|
July 22, 2025
|
I'm facing a problem using medgemma model from the inference point
|
|
10
|
69
|
July 12, 2025
|
`save_to_disk` saving ALL data, even items I filtered out
|
|
2
|
18
|
August 21, 2025
|
Gated Models - Entering & Displaying Rejection Reason?
|
|
1
|
21
|
August 22, 2025
|
File Retention with Diffusion Models
|
|
3
|
18
|
August 22, 2025
|
Can I use LoRA with jhu-clsp/ettin-encoder-1b?
|
|
2
|
8
|
August 30, 2025
|
Dataset ‘Scottish Smallpipes in A Preview’ Suddenly Missing from Audio Index
|
|
2
|
10
|
August 29, 2025
|
AxiosError: Request failed with status code 403 when uploading a file with Streamlit
|
|
4
|
50
|
August 14, 2025
|
Space Build Error: 'libgl1-mesa-glx' not found despite no Dockerfile
|
|
3
|
139
|
August 13, 2025
|
Help Needed: LLM Model for Summarization
|
|
2
|
66
|
August 6, 2025
|
Broken Space After Debian13 Update And llama-cpp-python Update
|
|
3
|
25
|
August 30, 2025
|
The claim of paper authership has been pending for serval days
|
|
3
|
46
|
August 5, 2025
|
AG-BPE: Attention-Guided Tokenization Achieving State-of-the-Art Compression with 12x Smaller Vocabularies
|
|
4
|
57
|
August 6, 2025
|
RuntimeError: CUDA error: named symbol not found when using TorchAoConfig with Qwen2.5-VL-7B-Instruct model
|
|
5
|
49
|
July 24, 2025
|
Data storage for pre training Language Model
|
|
2
|
28
|
August 20, 2025
|
Gottfried Wilhelm Leibniz: A man over a Century ahead of his time (AI Language)
|
|
11
|
109
|
July 17, 2025
|
[Paper] WFGY 1.0: A Universal Semantic Kernel for Self-Healing LLMs
|
|
8
|
205
|
July 15, 2025
|
From “Sem Fundos” to “I Hate Background” – Evolving a Browser-Based Background Removal Tool
|
|
2
|
33
|
August 20, 2025
|