|
About the Models category
|
|
0
|
4405
|
August 12, 2020
|
|
Finetuning T5 problems
|
|
10
|
97
|
January 13, 2026
|
|
A Bidirectional LLM Firewall: Next Level X1 - help wanted!
|
|
16
|
77
|
January 13, 2026
|
|
In RAG systems, who's really responsible for hallucination... the model, the retriever, or the data?
|
|
4
|
176
|
January 11, 2026
|
|
I need a model for requirements extraction
|
|
6
|
894
|
January 10, 2026
|
|
Fine-tuning GR00T for Chess Setup on Unitree G1
|
|
2
|
17
|
January 10, 2026
|
|
Task="text2text-generation" and model="google/flan-t5-(base or large)" fails to generate testcases from description
|
|
2
|
22
|
January 8, 2026
|
|
InferenceClient image_to_image example fails for Qwen/Qwen-Image-Edit-2511 with fal-ai provider
|
|
3
|
28
|
January 8, 2026
|
|
Multiple answers for a context
|
|
8
|
57
|
January 6, 2026
|
|
I cannot seem to run any workflow on runpod on comfyui
|
|
3
|
73
|
January 5, 2026
|
|
“How do you preserve agent state across restarts?”
|
|
2
|
60
|
January 3, 2026
|
|
HF document not works when try to deploy on Sagemaker
|
|
3
|
21
|
January 2, 2026
|
|
Inquiry About 120s Timeout on Hugging Face Inference Endpoint for Llama 3.1-8B
|
|
3
|
123
|
December 30, 2025
|
|
Designing multi-agent pipelines with shared state — how are you approaching it?
|
|
2
|
45
|
December 25, 2025
|
|
I injected a physics engine into Llama-3-8B. It hallucinated its way to the right answer
|
|
3
|
162
|
December 24, 2025
|
|
Why Kayra-1 exists: a small Turkish model experiment
|
|
3
|
40
|
December 18, 2025
|
|
Small Decoder-only model < 1B parameters
|
|
3
|
322
|
December 16, 2025
|
|
Gemini System Prompt Extraction: AlphaTool Policy Analysis & Genesis Protocol Multi‑Agent Alternative
|
|
0
|
129
|
December 16, 2025
|
|
Natural Language to T-Sql issue: sqlcoder-7b-2 fails on complex T-SQL joins & date logic (offline, 40GB GPU)
|
|
1
|
20
|
December 16, 2025
|
|
AI Agent - How to create
|
|
9
|
3983
|
December 14, 2025
|
|
Best architecture for fine-grained Arabic Twitter hate speech classification (small/imbalanced dataset)
|
|
2
|
21
|
December 13, 2025
|
|
What is the best model for Marketing purposes for a specific industry?
|
|
1
|
904
|
December 9, 2025
|
|
Regarding the issue of fine-tuning the training of rule-based models, I would like to ask everyone to discuss it togethe
|
|
5
|
68
|
December 8, 2025
|
|
Prompt Injection concern with think tags
|
|
3
|
67
|
December 5, 2025
|
|
Models with high and/or controllable reasoning effort
|
|
2
|
71
|
December 5, 2025
|
|
What is the best text embedding model for ecommerce product search (short, noisy user queries)?
|
|
2
|
65
|
December 5, 2025
|
|
Proposal and Observation: Segmentation-Aware Base Models for Better Quality & Future Commercial Applications
|
|
0
|
18
|
December 1, 2025
|
|
Model Produces Chaotic / Repetitive Output When `top_k` Is Higher — How to Fix This
|
|
6
|
171
|
December 1, 2025
|
|
Run ONNXRUNTIME for insightface Model
|
|
2
|
1865
|
November 27, 2025
|
|
How to Evaluate Fine-Tuned LLMs?
|
|
3
|
87
|
November 27, 2025
|