Choosing Benchmarks for Fine-Tuned Models in Emotion Analysis

pavol58 · November 23, 2024, 8:10pm

Hello Hugging Face community,

I’m working on my master’s thesis, and I need your advice regarding the best way to validate my chosen models. My thesis focuses on emotion analysis in text(e.g., positive, negative, or more types of emotions). I’ve narrowed down my choices to 5 fine-tuned models from Hugging Face, but I’m facing challenges in selecting 3–4 benchmarks to evaluate them.

Here’s my situation:

Some of the models don’t have clearly documented benchmarks.
Others have benchmarks that are specific to their fine-tuning tasks, but these don’t overlap across all models.
The models share base models (e.g., DistilBERT, RoBERTa), but it feels like using benchmarks of the base models might not align with my goal.

My Questions:

Would it make sense to evaluate the fine-tuned models on the benchmarks of their base models, or is this approach flawed for emotion analysis tasks?
Should I focus on choosing a smaller set of models with entirely different base models to ensure diversity in evaluation?
How would you recommend selecting 3–4 benchmarks that are suitable for comparing models fine-tuned for diverse tasks (e.g., general sentiment, social media, or domain-specific emotion analysis)?

My goal is to compare these models effectively for emotion analysis tasks while maintaining scientific rigor. Any suggestions on benchmarks or how to approach this would be greatly appreciated!

Topic		Replies	Views
Getting unexpected results for fine tuned bert model Beginners	0	270	February 9, 2024
Recommendations for Sentiment Analysis Pre-trained Models Models	2	10161	July 27, 2024
How to choose a base model while fine tuning Beginners	0	947	August 11, 2023
How to Efficiently Fine-Tune Models on Custom Datasets with Limited Resources? Beginners	0	119	July 10, 2024
Is it possible to fine tune a Bert model using a small dataset (400 data)) Beginners	1	611	October 3, 2022

Choosing Benchmarks for Fine-Tuned Models in Emotion Analysis

Related topics