GPT2 returns sequence of <|endoftext|> after finetuning

tempdeltavalue · January 24, 2024, 1:33pm

Here’s I tried to finetune gpt2

tempdeltavalue/temp_l/blob/main/finetune_seq2seq.ipynb

{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": 1,
   "id": "225b0a06-4dd3-4c84-89ed-86de4adcd24b",
   "metadata": {},
   "outputs": [
    {
     "name": "stderr",
     "output_type": "stream",
     "text": [
      "C:\\Users\\tempdelta\\AppData\\Local\\Packages\\PythonSoftwareFoundation.Python.3.11_qbz5n2kfra8p0\\LocalCache\\local-packages\\Python311\\site-packages\\tqdm\\auto.py:21: TqdmWarning: IProgress not found. Please update jupyter and ipywidgets. See https://ipywidgets.readthedocs.io/en/stable/user_install.html\n",
      "  from .autonotebook import tqdm as notebook_tqdm\n"
     ]
    }
   ],
   "source": [
    "# https://huggingface.co/docs/peft/index\n",
    "\n",

This file has been truncated. show original

but after few iterations I started to get same sequences, for ex.
prompt: Prime number is

model output: <|endoftext|><|endoftext|><|endoftext|><|endoftext|><|endoftext|><|endoftext|><|endoftext|><|endoftext|>

By the way, my text divided into sentences without [EOS] tokens, in other words every item is separate sentence

tempdeltavalue · January 27, 2024, 12:20am

upd:

It’s reproducible even without finetuning

tempdeltavalue · January 31, 2024, 11:05am

one top screenshot - correct output

on bottom - eos sequence

But code generally the same

Topic		Replies	Views
Gpt2 token of specific string 🤗Transformers	0	295	March 30, 2023
GPT2 finetuning for text generation is getting overfitted Beginners	0	1109	August 27, 2021
[Help appreciated] GPT2 Finetuning results in Only Padding output 🤗Transformers	2	1607	June 5, 2023
DistillGpt2 only predicts endoftext if context is full Models	0	91	March 30, 2024
Error with <\|endoftext\|> in Tokenizer GPT2 🤗Tokenizers	2	7481	December 16, 2020

GPT2 returns sequence of <|endoftext|> after finetuning

Related topics