How to use Seq2seq Trainer with my original "[MASK]"

yusukemori · October 22, 2020, 3:31am

Hello,

My account name is yusukemori.

(I’m who asked about Seq2seq Trainer at https://github.com/huggingface/transformers/issues/7740 .
Thank you for giving me a quick, kind, and detailed reply then!)

This is my first time posting on this forum and I apologize if I’m being rude.

I’m now trying to find how to use my original “[mask]” for pre-training/fine-tuning the Seq2seq model.

I’m wondering how to randomly assign [mask] at training time (per epoch) and how to get Seq2seq Trainer to load it.

As far as BERT is concerned, there seems to be a BERTProcessor in huggingface/tokenizers, which is not intended to be used with the Seq2seq Trainer, if I understand correctly.
Is there any processer, such as “BARTProcessor”, for the Seq2seq Trainer?

Please allow me to ask one more question, is there a code for pre-training (from scratch) BART with the Seq2seq Trainer? (For fine-tuning, thanks to the clear example!)

I’m afraid this is a beginner’s, rude question.
Thank you for your help.

Sincerely,
yusukemori

valhalla · October 22, 2020, 2:57pm

Hi @yusukemori

The Seq2SeqTrainer examples doesn’t (yet!) support pre-training, however you could use Seq2SeqTrainer with your custom data processing logic(masking etc) as it’s a generic s2s trainer.
But you’ll need to implement masking yourself.

yusukemori · October 22, 2020, 4:48pm

Hi @valhalla

Thank you for your reply!

Thanks to your help, I’ve learned the following:

Seq2SeqTrainer examples don’t support pre-training now.
I can use Seq2SeqTrainer with my custom data processing logic, but I have to implement it by myself.

I will try to implement it!

I would like to thank you again for your quick advice.

yusukemori

Topic		Replies	Views
SpanBERT, ELECTRA, MARGE from scratch? Beginners	5	1380	July 22, 2023
Using Seq2SeqTrainer for decoders? 🤗Transformers	0	86	December 25, 2024
Trainer vs seq2seqtrainer 🤗Transformers	4	15081	November 15, 2024
Further Pretrain Basic BERT for sequence classification 🤗Transformers	4	1805	October 9, 2020
Some unintended things happen in Seq2SeqTrainer example 🤗Transformers	3	1583	November 30, 2020

How to use Seq2seq Trainer with my original "[MASK]"

Related topics