Implementing Triplet loss in Vit

I think overriding the function is the most reliable way.

https://stackoverflow.com/questions/66302371/how-to-specify-the-loss-function-when-finetuning-a-model-using-the-huggingface-t