Running blenderbot-3B locally does not produce same results as with inference API

Eichhof · March 25, 2022, 6:03pm

Hello

I tried the facebook/blenderbot-3B model using the Hosted Inference API and it works pretty well (facebook/blenderbot-3B · Hugging Face). Now I tried to use it locally with the Python script shown below. The created responses are much worse than from the inference API and do not make sense most of the time.

Is a different code used for the inference API or did I make a mistake?

from transformers import TFAutoModelForCausalLM, AutoTokenizer, BlenderbotTokenizer, TFBlenderbotForConditionalGeneration, TFT5ForConditionalGeneration, BlenderbotTokenizer, BlenderbotForConditionalGeneration
import tensorflow as tf
import torch

device = "cuda:0" if torch.cuda.is_available() else "cpu"
chat_bots = {
    'BlenderBot': [BlenderbotTokenizer.from_pretrained("hyunwoongko/blenderbot-9B"), BlenderbotForConditionalGeneration.from_pretrained("hyunwoongko/blenderbot-9B").to(device)],
}
key = 'BlenderBot'
tokenizer, model = chat_bots[key]

for step in range(100):
    new_user_input_ids = tokenizer.encode(input(">> User:") + tokenizer.eos_token, return_tensors='pt').to(device)
    if step > 0:
      bot_input_ids = torch.cat([chat_history_ids, new_user_input_ids], dim=-1)
    else:
      bot_input_ids = new_user_input_ids

    chat_history_ids = model.generate(bot_input_ids, max_length=1000, pad_token_id=tokenizer.eos_token_id).to(device)

    print("Bot: ", tokenizer.batch_decode(chat_history_ids, skip_special_tokens=True)[0])

Eichhof · March 28, 2022, 1:20pm

Does anyone have an idea? Any help is appreciated.

merve · March 28, 2022, 1:47pm

Hello

My guess is that your parameters might be different than the default ones in inference widget.

Topic		Replies	Views
Inference API works for flan-t5-xxl, but not for many other models I have tried with Jupyter/VSCode 🤗Transformers	0	367	June 15, 2023
How does the API inference work on models such as Blenderbot? Models	4	926	May 14, 2022
Adding a Hugging Face model in a mobile app Beginners	1	6048	March 9, 2024
Dumb Question: Seeing that my inference API links not working Beginners	1	36	July 10, 2025
All Hosted Inference Api's are giving the http 422 error Beginners	0	300	July 17, 2023

Running blenderbot-3B locally does not produce same results as with inference API

Related topics