How to handle translations one source language to many target sentences for the same language

sancelot · November 9, 2023, 3:29pm

Hi,
I am training a m2m100 model with more technical corpus data using IATE database (https://iate.europa.eu/)

I don’t know how to handle a case , when a source text may have different significations for the same sentence.
That means for one source language sentence , I will assign 3 targets language definitions

What would happen ???

by example , I have these data for the same term, I have 3 associated definitions in english language :

line[61163] = ['1448173', 'mechanical engineering', 'en', 'to jam', 'Term', 'Reliable', '', '0', '', '', 'COM', '2014-05-19T14:56:28.001Z']
line[61164] = ['1448173', 'mechanical engineering', 'en', 'to seize', 'Term', 'Reliable', '', '0', '', '', 'COM', '2014-05-19T14:56:28.001Z']
line[61165] = ['1448173', 'mechanical engineering', 'en', 'to get stuck', 'Term', 'Reliable', '', '0', '', '', 'COM', '2014-05-19T14:56:28.001Z']
line[61166] = ['1448173', 'mechanical engineering', 'de', 'festlaufen', 'Term', 'Reliable', '', '0', '', '', 'COM', '2014-05-19T14:56:28.001Z']

Topic		Replies	Views
[Feature Request] Is there an option for multiple target language in translation pipeline? 🤗Transformers	0	276	March 16, 2023
Translation - MBART, translation with identical source and target language, for text normalization 🤗Transformers	3	554	July 14, 2021
Finetuning a model for machine translation on a programming language Models	1	647	November 29, 2023
Matching original and translated words with MarianMT Models	1	1065	May 21, 2021
Language pair with multiple models on the model hub? 🤗Transformers	1	338	August 10, 2020

How to handle translations one source language to many target sentences for the same language

Related topics