GPT-2 text generation, structure of evaluation set for compute_metrics

mfedrau · September 28, 2022, 2:20pm

Hello everyone,

I´m currently reproducing the second task (generating articles from headline) of this tutorial: Text generation with GPT-2 - Model Differently
I understand that the ‘input_ids’ of the training data must be prepared in the the format ‘bos_token sep_token eos_token’. Now I want to add a compute_metrics function which will be called by the trainer and evaluates another set, thus the model has to predict the ‘content’ only given the ‘title’. How do I prepare the data for the evaluation set?
Is it just ‘bos_token sep_token’? Or has one to manipulate the ‘attention_mask’ as indicated here:
GPT2 for QA Pair Generation - #9 by valhalla?

Topic		Replies	Views
What is the correct format of input when fine-tuning GPT2 for text generation with batch input? Models	0	507	January 22, 2024
GPT-2 special tokens Models	2	1978	February 20, 2024
GPT-2 Data Preparation for Parsing Trees Intermediate	0	124	May 6, 2024
GPT2 for QA Pair Generation Research	9	8605	March 23, 2022
[Data processing] How to design a training loop for custom data by GPT2 model Beginners	1	144	August 24, 2023

GPT-2 text generation, structure of evaluation set for compute_metrics

Related topics