Weak Conversational Skills - dialogPT trained model issue

basiclaser · April 7, 2021, 11:16am

I noticed in most / all dialogPT tutorials, when somebody trains on top of it with their own data, the answers they get back from it always turn into “!!!?!?!!;,!.com?!” - “!!!” - “”, and stuff like that after about 3-5 questions. I also had this problem in my own training code. Why is that?

HatterTheMad · April 22, 2021, 10:34am

From my experience this correlates with:

Lack of fine-tuning for your specific length. I don’t know why that is the case but I have noticed a significant drop in this “!!!?!?!!;,!.com?!” thing once you increase the fine-tuning dataset size.
This seems to only occur on dialoGPT-small. Have not seen it once on the medium version. This is not that big a deal since if you can train dialoGPT-small, generaly you will be able to train dialoGPT-mid on the same GPU.

P.S. You had me confused for a second there . It’s not “dialogPT” it’s dialoGPT as it’s based on the GPT-2 model.

basiclaser · April 23, 2021, 7:08pm

“Lack of fine-tuning for your specific length” could you specify what you mean? more on the data cleaning / structuring side, or on the training side, training differently or for more checkpoints? FWIW i have a beefy 3090 and see the same thing on medium and large

Also, are you available for consultation? would be great to discuss this in detail for 3-4 hours.

Cheers!

jotter · January 11, 2023, 12:59am

I may be late here, but I had a similar issue when I was fine-tuning dialoGPT. It had to do with the way I was padding the sentences. In some of the more popular tutorials, I believe they padded it with an “EOS” token, but it was really “!”(the exclamation point token). Since every exchange filled in this with padding, when the conversation got too long, it would spit out nonsense like !!!. This is just speculation and I’m trying to figure out how to solve it, but check out your model! Hope this helps

jotter · March 12, 2023, 1:12am

Edit: It actually had to do with the above and not appropriately defining the attention mask.

Topic		Replies	Views
Dialogpt with irrelevant and weird response Beginners	2	45	February 28, 2025
DialoGPT fine-tuning dataset format Models	3	722	April 27, 2021
GPT-2 trained models output repeated "!" Beginners	2	2794	December 20, 2021
Fine Tuning a conversational model Beginners	0	552	April 3, 2024
Are there more recent alternatives to DialogRPT- dialog response ranking models? Community Calls	2	332	November 27, 2023

Weak Conversational Skills - dialogPT trained model issue

Related topics