Tutorial: Fine-tuning with custom datasets – sentiment, NER, and question answering

sgugger · August 19, 2020, 12:37pm

For 1, you can look in the training tutorial where there is an example in PyTorch.
For 2, the head is initialized randomly since we are using a checkpoint of the base model, it would be pretrained if we used a checkpoint that has been fine-tuned for sequence classification like distilbert-base-uncased-finetuned-sst-2-english.

Topic		Replies	Views
Chapter 3 questions Course	154	10946	December 7, 2025
Bert with Ner using python Beginners	0	161	November 2, 2023
Chapter 7 questions Course	121	10697	October 22, 2025
Doccano dataset for named entity recognition task using BERT Beginners	3	537	May 14, 2024
Overall accuracy in Finetuning dslim/bert-base-NER with custom dataset and labels gets only up to ~0.15 using seqeval 🤗Transformers	2	524	May 1, 2023

Tutorial: Fine-tuning with custom datasets – sentiment, NER, and question answering

Related topics