I would like to request adding elastic launch support to the notebook_launcher
function in the Accelerate library. This feature will enable fault-tolerant training directly from Python scripts, similar to the current CLI support. Currently, elastic launch is limited to the CLI. Adding this feature to notebook_launcher
would benefit users who prefer programmatic control within Python scripts, providing more flexibility and robustness. Thank you for considering this request!