Nuance in usage of GPT2 when setting the attribute trainable

yananchen · August 27, 2021, 6:16am

gpt2 = GPT2LMHeadModel.from_pretrained(‘gpt2’, cache_dir=“./cache”, local_files_only=True)
gpt2.trainable = False
gpt2.config.pad_token_id=50256
gen_nlp = pipeline(“text-generation”, model=gpt2, tokenizer=tokenizer_gpt2, device=args.gpu, return_full_text=False)
contents = ds.df_train.sample(10)[‘content’].tolist()
results_trunk = gen_nlp(contents, max_length=64, do_sample=True, top_p=0.9, top_k=0,
repetition_penalty=1.0, num_return_sequences=4, clean_up_tokenization_spaces=True)

I use off-the-shelf GPT2 for open-ended generation. I find that there is a parameter

trainable

which needs to be set False or True before using.
Anyone know the nuance in this setting?
What is the best setting for this parameter?

Thanks.

Topic		Replies	Views
Fine tuning and retokenizing Beginners	0	589	May 29, 2022
GPT2 Training from scratch in German 🤗Transformers	3	2311	October 3, 2020
How to fine-tune GPT on my own data for text generation Beginners	0	2188	January 17, 2022
PretrainedConfig example to use it in GPT2 text-generation pipeline 🤗Transformers	1	588	February 6, 2021
Training GPT2 From Scratch in TensorFlow (TFGPT2) with generators Beginners	1	793	May 14, 2022

Nuance in usage of GPT2 when setting the attribute trainable

Related topics