Force word embeddings for a specific language with facebook/m2m100_418M

Ravishing3826 · April 19, 2023, 8:36am

I want to get embeddings from Facebook/m2m100_418M for a specific language.
Let’s say I have a French sentence and the same one manually translated into English. I get the input embeddings for the established direction French → English. I calculate the cosine similarity of inputs and get a very high score.

What If someone forgot to translate the text? The English text was in French not in English. Would it be possible to get a mapping of English text to French space that would result in a very low similarity score and alert the user?

Topic		Replies	Views
Arabic to French Word embedding Using skip-gram needs new Ideas in the data part Intermediate	0	31	April 23, 2025
Multilingual multiple languages fine-tuning on facebook mms model Models	2	670	July 17, 2024
Facebook mbart multilingual translation Beginners	0	499	February 1, 2023
Inference for facebook/mbart-large-cc25 Beginners	0	322	May 4, 2022
How to use embeddings to compute similarity? Beginners	4	4430	January 27, 2022

Force word embeddings for a specific language with facebook/m2m100_418M

Related topics