Any docs / experiment analysis?
ccc: How can I using deepspeed along with LION optimizer/.? 路 Issue #23987 路 huggingface/transformers (github.com)