Doing this solved the problem for me, although it still feels like a bug related to GenerationConfig
:
config = transformers.GenerationConfig()
config.eos_token_id = var.tokenizer.eos_token_id
output_ids = var.model.generate(
input_ids,
generation_config = config,
)