loss vs. train_loss is at the bottom of this: [trainer] 'train_loss' different from 'loss'
flos is here (kinda): "total_flos" showing much bigger number than expected · Issue #15006 · huggingface/transformers · GitHub
loss vs. train_loss is at the bottom of this: [trainer] 'train_loss' different from 'loss'
flos is here (kinda): "total_flos" showing much bigger number than expected · Issue #15006 · huggingface/transformers · GitHub