Pre-Train BERT (from scratch)

Hannibal046 · March 15, 2022, 9:07am

Any progress here ? I would be so convenient to train a Bert from scratch using datasets and transformers. Does anyone achieve this with comparable results as original Bert ?

cathyx · June 13, 2022, 2:36pm

Hi @BramVanroy is there an example for pretraining bert on NSP tasks with dataset.map? Thanks!

cathyx · June 13, 2022, 2:44pm

Hi @vblagoje , I found the file_path param of TextDatasetForNextSentencePrediction is only one file. Does it mean that I need to convert all datasets into one file when splitting sentences? But this file will be too big.

lhoestq · June 27, 2022, 2:08pm

To chunk the articles you can check https://huggingface.co/docs/datasets/processing.html#augmenting-the-dataset

new link is https://huggingface.co/docs/datasets/process#data-augmentation

Topic		Replies	Views
How to train BERT from scratch on a new domain for both MLM and NSP? Models	2	2295	February 6, 2021
Pre-Train BERT from scratch 🤗Transformers	5	15414	May 30, 2023
Continual pre-training from an initial checkpoint with MLM and NSP Models	4	4285	September 8, 2021
Original Bert Pretraining Intermediate	0	546	January 10, 2022
BERT Next Sentence Prediction: How to do predictions? Beginners	5	7551	September 29, 2022

Pre-Train BERT (from scratch)

Related topics