Problem with model inference using accelerate

Thank you @muellerzr the problem is solved.