Outputs change if re-using KVCache (past_key_values) for model.forward and generation

I tried the code and got a warning, but I wonder if this is related to the change.