Avoid saving deepspeed optimizer and model states at checkpoints

It looks really hard…

1 Like