GPT2 for QA Pair Generation

valhalla · October 13, 2020, 4:09pm

This seems correct. One more thing to add, you can calculate loss only on the question: ... part.

To do this set labels to -100 for tokens before the question: part, so cross entropy will ignore it.

Also you won’t need to explicitly set some arguments (position_ids, head_mask etc) to None.
They are by default None so it’s okay if don’t pass them. Will make the code more cleaner.

Topic		Replies	Views
Question-Answering/Text-generation/Summarizing: Fine-tune on multiple answers Beginners	8	5274	November 20, 2021
GPT-2 text generation, structure of evaluation set for compute_metrics Beginners	0	1025	September 28, 2022
How to teach a gpt-2 for Q&A? Models	0	2163	March 1, 2023
Presenting A Pair of Inputs For A New T5 Model Beginners	0	219	October 19, 2022
Text Generation in an Interview-Style with GPT-3 Beginners	1	526	November 4, 2023

GPT2 for QA Pair Generation

Related topics