Hmm… Maybe this?
https://stackoverflow.com/questions/71581197/what-is-the-loss-function-used-in-trainer-from-the-transformers-library-of-huggi https://stackoverflow.com/questions/72350835/how-to-plot-loss-when-using-hugginfaces-trainer