Transformers for small datasets?

FinnOl · October 7, 2024, 10:53am

Many transformer models, like BERT and GPT, perform well on large datasets, but what about fine-turning them on smaller and highly specialized datasets? how can one determine the optimal learning rate or batch size for such cases?

securerat · October 7, 2024, 11:40pm

Following the post with curiosity, great question.

FinnOl · October 9, 2024, 5:23am

is anybody there who can guide about it and he should also have some information about grooming a show cocker?

John6666 · October 9, 2024, 5:35am

I’m there, but it’s too technical to answer…
It’s a subject that someone might write a paper or an article on.

Topic		Replies	Views
BERT model trained on small corpus (English)? 🤗Transformers	0	337	May 8, 2021
Finding good batch size and learning rate for fine tuning Beginners	0	6365	January 24, 2022
Using EXTREMELY small dataset to finetune BERT 🤗Transformers	6	13432	February 1, 2023
Dataset size for fine-tuning Beginners	0	607	May 21, 2021
Learning rate and Data size 🤗Transformers	1	627	April 2, 2022

Transformers for small datasets?

Related topics