Also I am able to replicate this in Jupyter notebook . But using SFT Trainer I am able to train .
able to train with 13B model?
Also I am able to replicate this in Jupyter notebook . But using SFT Trainer I am able to train .
able to train with 13B model?