Grouphug: multi-task, multi-dataset training with 🤗 transformers/datasets

Sanderbaduk · June 15, 2022, 7:26am

I recently released grouphug - a package optimized for training on multiple datasets/dataframes at once, with each containing an arbitary subset of tasks, built on transformers/datasets.

The need for this came from wanting a single model to predict many closely related things like message topic, sentiment, toxicity, etc, with the inference speed of a single model, and better accuracy.
I have also found that co-training on a masked language modelling task results in models which generalize very well and do not start overfitting.

Even for single-task modelling, the classification head is also a good deal more powerful than the usual default, and the dataset formatter may be useful to quickly turn your dataframes into the format needed.

Would love to hear if this is useful for anyone else, and any suggestions you have!

Topic		Replies	Views
Transformers for small datasets? Beginners	3	74	October 9, 2024
Looking for (classifier, dataset) pairs across languages (or just classification datasets) Models	0	310	August 18, 2020
[Open-to-the-community] One week team-effort to reach v2.0 of HF datasets library 🤗Datasets	292	13867	October 30, 2022
Regarding Training a Task Specific Knowledge Distillation model 🤗Transformers	8	3411	September 2, 2023
Tutorial notebooks 🤗Transformers	9	1611	February 14, 2022

Grouphug: multi-task, multi-dataset training with 🤗 transformers/datasets

Related topics