Inflated GPU memory footprint of model prepared via accelerate

@tongyx361 Check this out : Data Parallel Multi GPU Inference

1 Like