Should I train Bert line by line?

indiejoseph · September 16, 2023, 10:35am

Hi there, i was wondering should i train my model in line by line manner, what is the advantage of it?
The training time with --line_by_line with train_mlm.py is 2x compare to without it.

deadbod-81 · September 18, 2023, 2:26pm

need some more context on what you are trying to achieve and how you are implementing it?

indiejoseph · September 18, 2023, 3:16pm

I have continue pre-trained a model on a bert base model for a new language without --line_by_line, as a linguistic statistical model it is able to predict a probability of a masked word, but i just curious what it the use case for a model trained with line_by_line? is it for LLM to capture more context?

Topic		Replies	Views
Script run_mlm.py line by line 🤗Transformers	1	676	January 24, 2022
MLM vs CLM, can be exchanged? Models	0	1053	August 21, 2022
Pre-training BERT Models	1	381	May 21, 2024
Bert LM pretraining: training loss goes to 0 at masking probability of 0.999 Beginners	2	2319	October 31, 2020
Create my LLM model Beginners	1	1588	April 1, 2024

Should I train Bert line by line?

Related topics