Fine-tune BERT and Camembert for regression problem

Is the training loss also increasing?

1 Like