Employing Different Tokenizers in a Translation Model

Olfat · July 27, 2023, 4:28pm

I am trying to fine tune a translation model, but I want to try different tokenizers. So, this means I will not be using the same tokenizer for both languages. How to proceed with that in terms of the preprocessing function, the data collating and the seq2seq training?

Topic		Replies	Views
Fine-Tuning a Text2Text Model using different tokenizer 🤗Transformers	5	72	January 20, 2025
Can i use a tokenizer x for a model y Models	1	1914	April 20, 2023
Fine tuning a T5 model for translation - How do I apply my trained tokenizer to the target sentences? 🤗Tokenizers	0	39	July 20, 2024
Mix and Match models and tokenizers for different operations over the same data? Beginners	0	213	October 31, 2023
Finetune different language pair on pretrained translation model Models	1	953	May 26, 2022

Employing Different Tokenizers in a Translation Model

Related topics