Cost to fine tune large transformer models on the cloud?

mickeymnemonic · November 29, 2021, 4:05am

hi folks

curious if anyone has experience fine tuning RoBERTa for purposes of text classification for sentiment analysis on a dataset of ~1000 sentences on a model like RoBERTa or BERT large?

similarly, any idea how much it would cost to further pretrain the language model first on 1GB of uncompressed text?

thank you,

mick

MarktHart · November 29, 2021, 4:25pm

Didn’t use RoBERTa, did use BERT. Finetuning BERT can be done with google colab in decent time, i.e. is sort of free.

Pretraining I cannot say in advance. 1 GB of text data is a lot. Try 10MB for a few epochs first to make a rough estimation. Results are also not guaranteed to improve

Topic		Replies	Views
Fine tuning Sequence 🤗Transformers	0	209	August 27, 2021
Finetuning cost estimation Languages at Hugging Face	2	2573	October 2, 2023
Finetuning cost estimator formula 🤗Transformers	0	511	October 1, 2023
RoBERT model for Sinhala Language Beginners	0	567	May 25, 2021
Pre-Training From Scratch 🤗Transformers	0	1004	October 6, 2021

Cost to fine tune large transformer models on the cloud?

Related topics