How can I fine tune with my own dataset?

cocoshe · May 3, 2022, 9:46am

I see the the tutorials and there are all dataset using api, so I wanna ask if I want to load a dataset in my computer, and I wanna use Trainer API, should I rename my dataset’s column name?
By the way, I’am confused about the column name of “train_dataset” in Trainer API parameter, how it know which column is my input data and which column is my label?(seems different dataset have different column name because of different tasks(some are classification, some are next sentence prediction), but they are all used as “train_dataset” in Trainer API? Did I miss something important?)
And how can we know the batch_size of Trainer.train()? I only know the Dataloader in pytorch.
Thanks so much if someone could answer my question!(something like blog or tutorial would be better!)

Topic		Replies	Views
Column names of custom dataset for use with trainer Beginners	3	5433	March 31, 2024
"Trainer - a PyTorch optimized training loop" example code Beginners	1	487	November 1, 2022
Load torchtext.data.dataset.Dataset to Trainer Beginners	0	558	October 20, 2020
Type of dataset in Trainer class Beginners	3	2428	July 20, 2020
Problems when using PyTorch Class Dataset in model fineturn Beginners	0	220	July 12, 2023

How can I fine tune with my own dataset?

Related topics