Can we fine-tune right away after add_tokens?

cravephi · January 31, 2023, 12:33pm

Hello,

Following the documentation, I could add new token to a Bert tokenizer: huggingface.co/docs/transformers/…/add_tokens

New tokens should help Bert in my ModelForSequenceClassification task, but actually, not really. Without the new tokens, the model perfomed better after some epochs of training.

I wonder if I am not missing a step here. Should I first go back to a MLM task using my training dataset, and then fine-tune for my classification?
Thanks!

Topic		Replies	Views
How to "further pretrain" a tokenizer (do I need to do so?) 🤗Tokenizers	5	4382	February 20, 2022
Does the tokenization in BERT change after fine-tuning? Models	0	590	January 27, 2023
Adding new tokens while preserving tokenization of adjacent tokens 🤗Tokenizers	4	18710	January 25, 2024
Adding a new mask_token for BERT-like models/tokenizers Intermediate	0	543	May 26, 2023
Process to adding new tokens to a corpus and subsequently training the corresponding word embeddings Beginners	0	3761	April 21, 2021

Can we fine-tune right away after add_tokens?

Related topics