Feature extraction for regression/classification vs Fine Tuning

thecity2 · February 11, 2021, 6:23pm

This is sort of a general question, but I’ve been working on fine tuning some models on regression task using GPU instances on AWS. I can already estimate the cost seems to be rather astronomical. So I’m wondering as a cheaper option (hopefully) if I should just extract the pooled output and then run plain old regression models on a distributed architecture like Spark. Does anyone have experience comparing these two options in terms of cost and performance? Thanks!

lewtun · February 11, 2021, 10:19pm

Hi @thecity2, your dataset must be huge if you’re considering running Spark jobs on the model outputs .

I’ve never done this exact comparison (Spark vs GPU), but can’t you get a rough estimate by doing the fine-tuning vs feature extraction comparison on a subset of the dataset? This would also give you an idea about whether the accuracy (or whatever metric you’re measuring) is good enough in the feature-based approach - in some cases, I’ve seen massive drops compared to fine-tuning.

HTH!

Topic		Replies	Views
What hardware do you use to train your models? Cloud or local? Intermediate	0	779	October 31, 2022
Estimating Training Time for Fine Tuning Beginners	2	4111	November 2, 2020
Finetuning and single-GPU utilization 🤗Transformers	0	489	August 19, 2021
Finetuning for feature-extraction? I.e. unsupervised fine tuning? Intermediate	10	5542	June 25, 2023
Fine-tuning BERT with multiple classification heads 🤗Transformers	10	5549	January 19, 2024

Feature extraction for regression/classification vs Fine Tuning

Related topics