|
Started a new project- creating a Mamba Hybrid Prototype
|
|
1
|
37
|
August 7, 2025
|
|
HFLM Python integration way slower than CLI
|
|
4
|
35
|
August 6, 2025
|
|
Why is context length not displayed?
|
|
5
|
90
|
August 6, 2025
|
|
Multi-page Document Classification
|
|
4
|
2972
|
August 5, 2025
|
|
CAS service error when downloading gated models on Databricks even with HF_HUB_DISABLE_XET=1
|
|
15
|
921
|
August 5, 2025
|
|
Sentence similarity models not capturing opposite sentences
|
|
12
|
4547
|
August 4, 2025
|
|
Gen 2 Deterministic AI - Tribot-9.98m-micro Public Release
|
|
6
|
79
|
August 4, 2025
|
|
I've built a LLM pre-processing toolbox and would love to hear your feedback
|
|
1
|
40
|
August 3, 2025
|
|
Why Do We Settle for Less?
|
|
36
|
440
|
August 2, 2025
|
|
Systematic rejection for Llama models access - Need review
|
|
3
|
98
|
August 1, 2025
|
|
Tool/Function calling abilities of LLM's that are used locally pulled through ollama
|
|
2
|
132
|
August 1, 2025
|
|
Pruned Llama on lm-evaluation-harness
|
|
1
|
201
|
August 1, 2025
|
|
Issues with expanding capacity of pretrained Qwen2.5-1.5B
|
|
0
|
20
|
August 1, 2025
|
|
Bug: Granite 4.0 Tiny Preview inference broken
|
|
1
|
42
|
July 30, 2025
|
|
Looking for Starter-Model Tips: What’s Your Go-To Baseline for the “DeFi-Behavior” Demo?
|
|
0
|
22
|
July 29, 2025
|
|
Fine tuning and it's effects on model safety
|
|
7
|
161
|
July 28, 2025
|
|
Is there any difference between GPT-J and GPT-2?
|
|
4
|
2846
|
July 28, 2025
|
|
Built my own ChatGPT-like bot – needs help with masking
|
|
2
|
112
|
July 27, 2025
|
|
Identify model requirements in memory and disk
|
|
1
|
90
|
July 26, 2025
|
|
Model huggingfaceh4/zephyr-7b-alpha is not supported for task text-generation
|
|
1
|
199
|
July 24, 2025
|
|
Traditional monitoring falls short for AI in production — what are you using instead?
|
|
0
|
19
|
July 24, 2025
|
|
Your request to access this repo has been successfully submitted, and is pending a review from the repo's authors
|
|
12
|
36313
|
July 23, 2025
|
|
Preparing audio and transcripts for fine-tuning Whisper
|
|
3
|
57
|
July 22, 2025
|
|
Dall E Mini clogged since 5 days
|
|
1
|
46
|
July 22, 2025
|
|
What architecture enables pose-consistent, photorealistic virtual outfit changes?
|
|
1
|
61
|
July 22, 2025
|
|
Best model to fine-tune for code explanation and debugging assistant (zero-cost deployment goal)
|
|
2
|
311
|
July 22, 2025
|
|
API returning 404 Not Found for Text Models, but Image Models work
|
|
1
|
74
|
July 22, 2025
|
|
How long does image generation with black-forest-labs/FLUX.1-dev take?
|
|
4
|
117
|
July 22, 2025
|
|
Is this possible?
|
|
2
|
135
|
July 21, 2025
|
|
Call for Participation in Academic Research on AI in Project Management
|
|
0
|
33
|
July 18, 2025
|