Is it possible to get the data that is seen by the model during training?

jaydeepb · May 15, 2024, 7:33pm

I’m fine-tuning GPT-2 XL for 3 epochs and I am wondering how I can get the data seen by the model every 242 steps. I thought of extracting the number of rows from the original training dataset (input_ids) by multiplying batch size with the number of steps, but I’m guessing the order of the data might be shuffled during training so it might not be the right thing to do. I’d appreciate any help or directions.

These are my training args:
training_args = TrainingArguments( f"models/XL/", evaluation_strategy = "steps", learning_rate=2e-5, weight_decay=0.01, push_to_hub=False, num_train_epochs=3, per_device_train_batch_size=8, per_device_eval_batch_size=8, save_strategy="steps", save_steps = 242, fp16=True, report_to="none", logging_strategy="steps", logging_steps=50, )

jaydeepb · May 26, 2024, 1:51am

@sgugger could you help here?

Topic		Replies	Views
How can I get the order in which training data is seen? Beginners	0	102	May 14, 2024
GPT2 training examples 🤗Transformers	0	303	October 29, 2021
Trainer does not show epochs or steps just 1 line without numbers Course	0	414	October 5, 2023
Explicitly set number of training steps using Trainer 🤗Transformers	5	9328	September 16, 2020
How to train gpt-2 from scratch? (no fine-tuning) Beginners	17	19033	December 14, 2022

Is it possible to get the data that is seen by the model during training?

Related topics