Are there any smart loss functions for a sequence of float vectors?

jahb57 · January 7, 2024, 2:39pm

Hi,

I am hoping to train a transformer to predict a sequence of vector embeddings.

I have a target sequence and was thinking of doing something like Avg cosine distance or MSE of all vectors in the sequence. I haven’t seen these distance metrics being used like this before so thought I would post here if anyone has recommendation on better loss functions.

I am not sure if I should be doing something more thought out. For example averaging the cosine distance then would mean the loss function does not account for the order of the embeddings. Though since the target sequence is a sequence of Transformer embeddings which already gone through positional encoding at some point I also wonder if it is necessary.

Topic		Replies	Views
Sentence transformers - SoftmaxLoss Models	1	968	June 20, 2024
How to use RMSE loss for regression Beginners	0	146	January 24, 2024
Why does TripletMargin Loss function default is euclidean? What are the advantages? In regard to cosine similarity Beginners	0	331	October 13, 2023
Triplet (contrastive) loss for sequence embedding 🤗Transformers	0	2367	September 24, 2020
Weighed Loss Function in Regression Task Intermediate	1	625	April 6, 2024

Are there any smart loss functions for a sequence of float vectors?

Related topics