How to separate multiturn dialog context in blenderbot?

LiSY · December 5, 2021, 8:14am

Hello. I want to perform multiturn dialog with blenderbot model. I followed the documentation and here’s the code:

chat_history = []
uttr_sep = '</s> <s>'

while(True):
    UTTR_input = input('User: ')
    chat_history.append(UTTR_input)
    UTTR = uttr_sep.join(chat_history)
    print('Input: ' + UTTR)

    inputs = tokenizer([UTTR], return_tensors='pt')
    reply_ids = model.generate(**inputs)
    REPLY = tokenizer.batch_decode(reply_ids, skip_special_tokens=True)[0]
    print("Bot:\n" + REPLY)

    chat_history.append(REPLY)

But model treats context as user’s input and sometimes </s> <s> shows in model’s responses.

In this issue, there are discussions about what to use to separate multiturns including </s> <s>, </sep>, \n, \t, but no conclusions.

Topic		Replies	Views
Structuring chat histories while also mitigating more than one chatbot response 🤗Datasets	0	398	December 16, 2023
How does the API inference work on models such as Blenderbot? Models	4	922	May 14, 2022
Inserting HTML into Chatbot (Blocks) 🔒 Gradio	1	1152	November 6, 2023
Chatbot message history needs to clear after reload 🔒 Gradio	0	858	September 26, 2023
Two text submit buttons in gr.ChatInterface to simulate a conversation 🔒 Gradio	0	388	October 3, 2023

How to separate multiturn dialog context in blenderbot?

Related topics