How to choose a base model while fine tuning

subhojit777 · August 11, 2023, 12:22pm

Hello. I am new to NLP. I am researching how huggingface can best be utilized for a use case in my organization. Use case is - support ticket categorization. This seems like a text-classification task. I am planning to fine tune a base model based on the organization’s dataset.

My question is - How to choose the best base model that suits my purpose. I have seen in the documentation that, although the task is text-classification, but fill-mask model is used. Here the task in text-classification but distilbert-base-uncased model is used which is fill-mask type.

So, is it okay if any NLP base model is used for training? How does that work?

Topic		Replies	Views
Chapter 7 questions Course	119	10299	July 10, 2025
Choosing Benchmarks for Fine-Tuned Models in Emotion Analysis Research	0	153	November 23, 2024
LM fine-tuning on unlabelled dataset Beginners	0	443	April 10, 2021
Getting unexpected results for fine tuned bert model Beginners	0	271	February 9, 2024
DistilBert for Self-Supervision - switch heads for pre-training: MaskedLM and SequenceClassification Beginners	0	223	February 16, 2023

How to choose a base model while fine tuning

Related topics