Finetuning ALBERT with custom unlabeled dataset for next sentence prediction task

FeryET · September 2, 2020, 7:32am

Hi.

I’m trying to figure out how to make my dataset compatible with ALBERT for next sentence prediction task. How should I generate the next sentence logits? Are there any examples? I have around 1 million paragraphs with around 300 words each, and my dataset is completely unlabelled (but is domain specific).

Topic		Replies	Views
Roberta nsp prediction Beginners	0	356	June 11, 2021
Training ALBERT from scratch with Distributed Training 🤗Transformers	0	1705	September 25, 2020
Request for Further Information on Datasets Beginners	0	280	November 26, 2020
How to fine-tune BERT model for next word prediction? Beginners	0	1113	October 3, 2021
BERT Next Sentence Prediction: How to do predictions? Beginners	5	7546	September 29, 2022

Finetuning ALBERT with custom unlabeled dataset for next sentence prediction task

Related topics