Top performer for image classification

joeostr · June 4, 2025, 9:07pm

As of today, which model is the absolute best and most accurate for fine-tuning with a custom dataset for NSFW image classification across a few labels?

John6666 · June 5, 2025, 6:07am

If detection is all that is required, ViT may be sufficient. If detailed information needs to be extracted, an approach using a multimodal model such as JoyCaption could also be considered.

joeostr · June 5, 2025, 2:48pm

@John6666, i dont need extraction. I dont even need detection to identify parts, i just need a super accurate label for the image as a whole. Im looking for 98% accuracy. I have about 40k images per label. Which ViT model would you recommend?

John6666 · June 6, 2025, 10:38am

If precision is required… 98% should be just about okay…?

Topic		Replies	Views
OCR Confidence score extraction for OpenGVLab/InternVL2_5-8B-MPO Models	2	89	February 6, 2025
Need help in determining model quality Beginners	32	123	January 2, 2025
How to Train an Image Captioning Model for specific language Beginners	3	20	March 9, 2025
Extracting metadata from images using LLMs Beginners	2	32	June 18, 2025
Incremental learning for image captioning 🤗Transformers	3	85	October 1, 2024

Top performer for image classification

Related topics