Advice Needed for Training an Imbalanced Dataset AI Model: lr, Epochs, and neuronal Architecture

TPM-28 · November 26, 2023, 9:23am

l’m developing an AI model to determine whether a question is ‘asktoask’ (true) or not (false). My dataset is imbalanced, with more examples of non-‘asktoask’ questions than ‘asktoask’ questions. I would appreciate suggestions on training parameters such as the learning rate (lr), the number of epochs, and the model architecture. What strategies or tips do you recommend for effectively training this model while handling class imbalance in the dataset? Your insights are welcome.

panigrah · November 26, 2023, 3:38pm

Does your dataset reflect real world? I.e. there are more yes than nos? If yes then don’t change the dataset.

If not, then why not reduce your true set to reflect reality

TPM-28 · November 26, 2023, 4:58pm

If there are more ‘no’ than ‘yes’ in my dataset, and in the real world as well…
So, this thought is not dumb if there are more ‘false’ in reality than ‘true’ in the dataset as well

panigrah · November 27, 2023, 11:46am

So you shouldn’t have to do anything if dataset is similar to real world. Train the model and see how it performs before deciding what to do.

Topic		Replies	Views
Advice Needed for Training an Imbalanced Dataset AI Model: lr, Epochs, and Architecture Beginners	0	198	November 1, 2023
Handling Imbalanced Dataset 🤗Datasets	0	170	June 20, 2024
Mcq question answering Models	0	443	October 31, 2021
How to dealing with Data Imbalance 🤗Datasets	2	6331	July 28, 2020
Re-training NLP model with training AND validation dataset after validation has been done Models	4	2107	November 29, 2021

Advice Needed for Training an Imbalanced Dataset AI Model: lr, Epochs, and neuronal Architecture

Related topics