@patrickvonplaten @ttj I think this is a good question! Could we discuss on how to do batch inference with past_key_values
?
@patrickvonplaten @ttj I think this is a good question! Could we discuss on how to do batch inference with past_key_values
?