Weird characters in generated text

JohnnySalami · June 25, 2022, 4:43pm

I have trained a GPT-2 model on my dataset. When I generate text from it, sometimes it would generate text like “��”. Going on and on. Some other times, it will generate normal text but then it will start generating just “�” characters. Does anybody have a clue why it is doing this?

JohnnySalami · October 20, 2022, 4:51am

Problem was that when training using TPUs, weights for input embedding layer and output embedding doesnt get tied automatically. When using the HuggingFace Trainer API, it does, but when writing a custom training script you will have to write code to manually do it.

Topic		Replies	Views
Text Generation, adding random words, weird linebreaks & symbols at random Beginners	5	982	May 24, 2021
How does the Trainer work for Text Generation? Beginners	0	1019	August 11, 2021
GPT-2 fine-tuning Beginners	0	1606	June 12, 2023
Fine tune the text generation with gpt2 Beginners	2	441	February 22, 2023
How to fine-tune GPT on my own data for text generation Beginners	0	2188	January 17, 2022

Weird characters in generated text

Related topics