Any documented examples of using DeepSpeed without trainer?

I would like to be able to use DeepSpeed with NLP Transformers but I don’t want to use Trainer. Are there any examples of this available?

You can try with Accelerate instead! DeepSpeed