Why new lines aren't generated?

GabrielTamujo · December 4, 2020, 2:14pm

Hello!

I’m currently trying to fine-tune DistilGPT-2 with Pytorch for a code completion task. My corpus is arranged like the following example:

<|startoftext|>
public class FindCityByIdService {
    private CityRepository cityRepository = ...
<|endoftext|>

My first attempt was to run the following command:

python run_clm.py 
     --model_type=gpt2 \
     --model_name_or_path distilgpt2 \
     --do_train \
     --train_file $TRAIN_FILE \
     --num_train_epochs 100 \
     --output_dir $OUTPUT_DIR \
     --overwrite_output_dir \
     --save_steps 20000 \
     --per_device_train_batch_size 4 \

After doing some generation tests, I realized that the model is not predicting \ n for any given context. I imagine that some pre-process stage or something similar is missing. But anyway, what should I do so that \ n be predicted as expected?

Thanks!!

Topic		Replies	Views
Fine-tuning DistilGPT2 on custom data, training Accuracy 100%, output is garbage 🤗Transformers	4	2124	January 31, 2024
Distilgpt2 model Beginners	0	54	August 1, 2024
How to save tokenizer after finetunning distilgpt2 model Beginners	2	586	March 18, 2022
Is it possible to generate GPT2 output without an input prompt text Beginners	5	4409	March 14, 2021
DistillGpt2 only predicts endoftext if context is full Models	0	91	March 30, 2024

Why new lines aren't generated?

Related topics