How to train a model on multiple datasets

Altabus · September 18, 2023, 2:25pm

I was able to make a translation model as shown in the manual, but the quality is not very good, I wanted to feed it several datasets, but I realized that they do not add to each other, but overwrite the model, so how can I feed it several datasets?

And in general, did I decide to solve this problem correctly?

dblakely · September 18, 2023, 3:41pm

Huggingface has an interleave datasets function you could check out to combine several datasets together.

And in general, did I decide to solve this problem correctly?

Using more data very well might help, but hard to say without more context. Lots of things can make a model good or bad.

Topic		Replies	Views
Train through multiple datasets Beginners	1	1627	June 13, 2022
How to sample batches from multiple datasets? 🤗Datasets	2	1935	January 18, 2024
Dataset curation extra parameters Beginners	2	31	January 19, 2025
How to create a dataset for translation Beginners	1	461	September 25, 2023
How to make a translation dataset Beginners	3	2795	November 18, 2023

How to train a model on multiple datasets

Related topics