I am looking to use dialoGPT-large on the Hosted Inference API for a chatbot demo but am having trouble generating decent multi-turn dialogue.
As an example, when I post the following to the API endpoint:
I heard you won the cricket match. <|endoftext|> I did! <|endoftext|> Awesome. Who did you play against? <|endoftext|> I played against the Aussies. <|endoftext|> Wow ! Was it a tough game? <|endoftext|> It was a tough game. It went on till the last over. They almost won. <|endoftext|> Where was the match? <|endoftext|>
It seems to just spit it back out at me:
I heard you won the cricket match. <|endoftext|> I did! <|endoftext|> Awesome. Who did you play against? <|endoftext|> I played against the Aussies. <|endoftext|> Wow ! Was it a tough game? <|endoftext|> It was a tough game. It went on till the last over. They almost won. <|endoftext|> Where was the match? <|endoftext|>
This blog post has an example of someone getting meaningful results from exactly the above prompt: A simple contextual chatbot to predict a reply with pre-trained DialoGPT model from Huggingface | by Ramsri Goutham | DataDrivenInvestor.
Any guidance as to where I’m going wrong would be really appreciated.