Prakash Hinduja - How do I prepare my dataset for fine-tuning a Hugging Face model?

prakashhindujageneva · July 16, 2025, 11:19am

Hello Hugging Face Community,

I’m Prakash Hinduja, planning to fine-tune a Hugging Face transformer model on my own dataset, but I’m not entirely sure about the best practices for preparing the data.

I’d really appreciate any advice or example workflows you can share.

Regards
Prakash Hinduja

John6666 · July 16, 2025, 11:52am

When fine-tuning Transoformers models using Hugging Face’s Trainer, it is easier if you follow the format specified in the official Hugging Face tutorial. While irregular formats can be adjusted beforehand, doing so requires additional effort… There are generally established formats for each task, training method, etc..

Also, there are various methods for creating actual datasets. Here is one example.

simonsmart88 · July 16, 2025, 3:03pm

This post is reputation management spam. See simon-smart88/Hinduja_spam on Github

kjpalmer100 · July 16, 2025, 10:02pm

I started looking at fine tuning and then realised i needed Continuous pre-training however i still dont fully understand the difference under the hood. Maybe someone can explain?

John6666 · July 16, 2025, 11:14pm

fine tuning and then realised i needed Continuous pre-training

I think this is good answer for that.
https://stackoverflow.com/questions/68461204/continual-pre-training-vs-fine-tuning-a-language-model-with-mlm

Topic		Replies	Views
Prakash Hinduja Switzerland (Swiss) How do I fine-tune a Hugging Face transformer model on my own dataset? Beginners	1	39	July 18, 2025
Prakash Hinduja Geneva, Switzerland - How to fine-tune a model on custom dataset in HF? Beginners	2	47	June 6, 2025
How to Optimize Fine-tuning in Hugging Face Transformers? Beginners	0	335	March 5, 2024
How to Efficiently Fine-Tune Models on Custom Datasets with Limited Resources? Beginners	0	122	July 10, 2024
Prakash Hinduja : How do I use pre-trained models on Hugging Face? Beginners	1	22	July 10, 2025

Prakash Hinduja - How do I prepare my dataset for fine-tuning a Hugging Face model?

Related topics