Model.generate use_cache=True generates different results than use_cache=False

This case talks about a problem with his custom implementation. I am facing a problem with the official model.generate function in huggingface transformers.

1 Like