Multi-turn dialogue using dialoGPT with Hosted Inference API

anthonyralston · July 27, 2020, 10:22am

I am looking to use dialoGPT-large on the Hosted Inference API for a chatbot demo but am having trouble generating decent multi-turn dialogue.

As an example, when I post the following to the API endpoint:

I heard you won the cricket match. <|endoftext|> I did! <|endoftext|> Awesome. Who did you play against? <|endoftext|> I played against the Aussies. <|endoftext|> Wow ! Was it a tough game? <|endoftext|> It was a tough game. It went on till the last over. They almost won. <|endoftext|> Where was the match? <|endoftext|>

It seems to just spit it back out at me:

I heard you won the cricket match. <|endoftext|> I did! <|endoftext|> Awesome. Who did you play against? <|endoftext|> I played against the Aussies. <|endoftext|> Wow ! Was it a tough game? <|endoftext|> It was a tough game. It went on till the last over. They almost won. <|endoftext|> Where was the match? <|endoftext|>

This blog post has an example of someone getting meaningful results from exactly the above prompt: A simple contextual chatbot to predict a reply with pre-trained DialoGPT model from Huggingface | by Ramsri Goutham | DataDrivenInvestor.

Any guidance as to where I’m going wrong would be really appreciated.

bhargavsdesai · July 30, 2020, 5:30am

Try without the spaces. Works for me.

anthonyralston · July 30, 2020, 10:07pm

Hmm, gave that a go but no luck. Assuming you mean no spaces between the end of text tokens and the text itself?

valhalla · July 31, 2020, 5:04am

Hi @anthonyralston, from the dialo-gpt paper, section 3.1

We first concatenate all dialog turns within a dialogue session into a long text x1, · · · , xN (N is the sequence length), ended by the end-of-text token.

So I think you won’t need to add eos tokens. Just concatenate your history and feed it to the model. That is how the model is trained.

Topic		Replies	Views
Chatbot Start Prompt for GPT-J 🤗Transformers	4	1298	October 31, 2022
Strange answer from api 🤗Transformers	0	617	January 10, 2022
Idefics2 multi turn inference 🤗Transformers	0	142	August 5, 2024
Dialogpt with irrelevant and weird response Beginners	2	49	February 28, 2025
How to prevent LLM from generating multiple rounds of conversation? Models	3	9211	February 29, 2024

Multi-turn dialogue using dialoGPT with Hosted Inference API

Related topics