Models

Topic	Replies	Views	Activity
Fine-tuning "reasoning" models	1	1123	January 23, 2025
Best model for extracting price from a page	2	54	January 23, 2025
AI Accuracy Issues When Analyzing Financial Reports: Seeking Solutions for Persistent Hallucinations	3	38	April 11, 2025
All-MiniLM-L12-v2 is only for EN?	2	32	February 26, 2025
Access to 'Google's Gemma models family' gated repo taking too long	2	118	January 9, 2025
Looking for an AI/ML Partner	0	55	February 25, 2025
SmolVLM 8bit Quantization Problem	3	446	November 29, 2024
How can I read PDFs with mPLUG/DocOwl2?	3	38	February 6, 2025
Generate Mock but realistic data using NLP	5	110	September 29, 2024
Best Model for sorting a dataset based on a user question	2	52	January 30, 2025
Qwen 2.5 coder 7b can't use correct separators	1	98	December 16, 2024
Downloads not being tracked	3	39	April 8, 2025
Analyze the fine tuning result	2	32	February 18, 2025
Qwen model corrupt output stream	3	90	January 6, 2025
Google Gemma2 access request still pending	1	103	January 4, 2025
Unexpected keyword argument 'negative_prompt' or 'target_size'	3	29	February 26, 2025
How can I detect the tone of text?	2	112	January 3, 2025
Unable to run quantized Llama2 70b model	2	91	December 30, 2024
Gemma 2 access pending	5	159	January 9, 2025
Finetuning llama-vision-3.2-instruct	2	414	November 11, 2024
How to train FLUX.1 for custom emoji generation — dataset size, script, and deployment?	1	37	April 8, 2025
HfHubHTTPError: 401 Client Error	6	4143	June 21, 2024
When I use convert_lora_safetensor_to_diffusers.py to convert lora trained weights to diffusers format， I got errors：AttributeError: 'UNet2DConditionModel' object has no attribute 'unet'	3	133	January 1, 2025
Cost of Tax receipt recognition OCR vs. LLM	2	164	March 22, 2025
Inquiry About 120s Timeout on Hugging Face Inference Endpoint for Llama 3.1-8B	1	24	March 28, 2025
ASR model of ai4Bharat	1	40	December 25, 2024
CUDA convert GUFF to CUDA GUFF	6	130	December 18, 2024
Which Model For Resumes?	2	236	March 13, 2025
ValueError: Unsupported model type mllama	3	397	October 23, 2024
Multi-Head Attention in Transformers	2	125	January 12, 2025