Fine-tuning "reasoning" models
|
|
1
|
1123
|
January 23, 2025
|
Best model for extracting price from a page
|
|
2
|
54
|
January 23, 2025
|
AI Accuracy Issues When Analyzing Financial Reports: Seeking Solutions for Persistent Hallucinations
|
|
3
|
38
|
April 11, 2025
|
All-MiniLM-L12-v2 is only for EN?
|
|
2
|
32
|
February 26, 2025
|
Access to 'Google's Gemma models family' gated repo taking too long
|
|
2
|
118
|
January 9, 2025
|
Looking for an AI/ML Partner
|
|
0
|
55
|
February 25, 2025
|
SmolVLM 8bit Quantization Problem
|
|
3
|
446
|
November 29, 2024
|
How can I read PDFs with mPLUG/DocOwl2?
|
|
3
|
38
|
February 6, 2025
|
Generate Mock but realistic data using NLP
|
|
5
|
110
|
September 29, 2024
|
Best Model for sorting a dataset based on a user question
|
|
2
|
52
|
January 30, 2025
|
Qwen 2.5 coder 7b can't use correct separators
|
|
1
|
98
|
December 16, 2024
|
Downloads not being tracked
|
|
3
|
39
|
April 8, 2025
|
Analyze the fine tuning result
|
|
2
|
32
|
February 18, 2025
|
Qwen model corrupt output stream
|
|
3
|
90
|
January 6, 2025
|
Google Gemma2 access request still pending
|
|
1
|
103
|
January 4, 2025
|
Unexpected keyword argument 'negative_prompt' or 'target_size'
|
|
3
|
29
|
February 26, 2025
|
How can I detect the tone of text?
|
|
2
|
112
|
January 3, 2025
|
Unable to run quantized Llama2 70b model
|
|
2
|
91
|
December 30, 2024
|
Gemma 2 access pending
|
|
5
|
159
|
January 9, 2025
|
Finetuning llama-vision-3.2-instruct
|
|
2
|
414
|
November 11, 2024
|
How to train FLUX.1 for custom emoji generation — dataset size, script, and deployment?
|
|
1
|
37
|
April 8, 2025
|
HfHubHTTPError: 401 Client Error
|
|
6
|
4143
|
June 21, 2024
|
When I use convert_lora_safetensor_to_diffusers.py to convert lora trained weights to diffusers format, I got errors:AttributeError: 'UNet2DConditionModel' object has no attribute 'unet'
|
|
3
|
133
|
January 1, 2025
|
Cost of Tax receipt recognition OCR vs. LLM
|
|
2
|
164
|
March 22, 2025
|
Inquiry About 120s Timeout on Hugging Face Inference Endpoint for Llama 3.1-8B
|
|
1
|
24
|
March 28, 2025
|
ASR model of ai4Bharat
|
|
1
|
40
|
December 25, 2024
|
CUDA convert GUFF to CUDA GUFF
|
|
6
|
130
|
December 18, 2024
|
Which Model For Resumes?
|
|
2
|
236
|
March 13, 2025
|
ValueError: Unsupported model type mllama
|
|
3
|
397
|
October 23, 2024
|
Multi-Head Attention in Transformers
|
|
2
|
125
|
January 12, 2025
|