Pretraining ALBERT

stoddur · April 19, 2021, 6:55am

Hello!

I have a question regarding using the transformers library to pretrain ALBERT. I have been using RoBERTa for some while now which I have pretrained with custom data with run_mlm.py from the examples directory which is fine since RoBERTa only uses MLM loss when pretraining. However, ALBERT adds sentence order prediction (SOP) which is not implemented in run_mlm.py. Are there any examples on implementing SOP which I have overlooked in the transformers library? If not, anyone care to share a code example of how to implement this? If not I would have to dive a bit deeper to implement from scratch but I’m hoping I won’t have to

Thanks!

Edit:
I found a DataCollatorForSOP which appears to solve this task: transformers/data_collator.py at d9c62047a8d75e18d2849d345ab3394875a712ef · huggingface/transformers · GitHub

Would be great to have a seperate file in examples which implements the pretraining for ALBERT.

hdm · February 6, 2022, 1:26pm

May I ask how you used that DataCollator? I’m also trying to pretrain ALBERT but I am also facing these difficulties.

ibraheemmoosa · February 16, 2022, 3:00pm

I have pretrained an ALBERT model last year. You do not need a special DataCollator for SOP. Just use the DataCollatorForLanguageModeling.

Topic		Replies	Views
Are albert-base-v1( and v2) pretrained enough? 🤗Transformers	4	354	October 26, 2021
Cannot find pre-trained SOP head of ALBERT Beginners	0	275	October 22, 2020
Inconsistencies between BERT and RoBERTa: what am I doing wrong? Beginners	0	360	May 11, 2022
Training ALBERT from scratch with Distributed Training 🤗Transformers	0	1705	September 25, 2020
Fine-tuning BERT Model on domain specific language and for classification 🤗Transformers	7	8428	November 14, 2024

Pretraining ALBERT

Related topics