Any docs / experiment analysis?
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Model parallel with deepspeed integration | 0 | 627 | September 14, 2021 | |
How DeepSpeed interacts with Trainer optimizer | 1 | 1170 | October 13, 2021 | |
Does Trainer hyperparameter search support deepspeed? | 0 | 214 | July 10, 2023 | |
SFTTrainer Doubling Speed on a Single GPU with DeepSpeed: Proposal for an Update to the Official Documentation and Verification Report | 1 | 44 | March 7, 2025 | |
Any documented examples of using DeepSpeed without trainer? | 1 | 186 | January 25, 2023 |