NER: Data augmentation by replacing some words with random string?

FantasticAI · August 18, 2021, 4:13am

Hi

I’m working on fine tuning a BERT model try to solve a NER problem. I have been given some labels dataset. The result is not as good as I expected. So I wonder if I replace some words by random string, then fine tune BERT model, will this strategy make the model learn more from the context rather than the word embedding? Since the random string will not be in the pretained model vocabulary.

Thanks

Topic		Replies	Views
Named Entity Recognition: fine-tune or create new model? Beginners	3	3541	February 11, 2023
Adding small data in fine tune model - bert Models	0	342	October 20, 2022
Using BERT for NER 🤗Transformers	0	398	August 16, 2022
How to fine-tune BERT model for next word prediction? Beginners	0	1113	October 3, 2021
Fine tuning NER BERT model on Phone numbers Beginners	3	1169	May 31, 2024

NER: Data augmentation by replacing some words with random string?

Related topics