How to separate sequences during finetuning gpt

Kwiebes1995 · December 19, 2020, 1:01pm

Hi!

I want to finetune the GPT model on my own data for causal language modeling. I currently use the script provided in the examples directory on the repository:
https://github.com/huggingface/transformers/tree/master/examples/language-modeling

My question is about the preprocessing of the data. I suppose that I need to indicate somehow what a sequence is. I understand how this can be done for GPT-2 but there does not seem to be a ‘[SEP]’ token for GPT. Would it be sufficient to just add this token to the vocabulary? Or did I miss something?

Topic		Replies	Views
Separation token in GPT for text similarity/question answering Models	2	1470	March 23, 2021
Using GPT-J for custom sequence classification Beginners	0	408	September 14, 2022
Task-specific fine-tuning of GPT2 Research	0	1048	April 22, 2021
How to fine-tune GPT on my own data for text generation Beginners	0	2191	January 17, 2022
How to do sequence fine tuning? Beginners	5	749	July 22, 2020

How to separate sequences during finetuning gpt

Related topics