Suggest a model to fine-tune for multi-class classification on a 32GB M1 Mac?

ratkins · March 4, 2024, 1:40pm

Can anyone suggest a model I could use to experiment with fine-tuning for multi-class classification on a 32GB RAM M1 Max MacBook Pro? I’m using the Huggingface Transformers library with the mps device so it runs on the Mac’s GPU.

I’ve tried bert-base-cased, but its token window is only 512, which isn’t enough for my use case—I need 2k-4k. I’ve tried Mixtral-8x7B-v0.1, but it hangs halfway through the training run. I just tried longformer-base-4096 but it runs out of memory.

Can someone suggest something that has a chance of working?

Edit: if it matters, there are a lot of classes—around 160.

Boltuzamaki · March 4, 2024, 1:51pm

If your usecase is related to multi-class classification then I think you should try setfit. Hugging Face have a very good starter blog for this

you can go with

sentence-transformers/paraphrase-mpnet-base-v2

this model

Topic		Replies	Views
Running transformer models on mps instead of cpu on mac Beginners	1	1770	January 18, 2025
Continuous Learning Using Trainer for BERT multi-class model Beginners	0	349	April 19, 2023
How to Efficiently Fine-Tune Models on Custom Datasets with Limited Resources? Beginners	0	120	July 10, 2024
Fine-tuning BERT with multiple classification heads 🤗Transformers	10	5568	January 19, 2024
How to Optimize Fine-tuning in Hugging Face Transformers? Beginners	0	335	March 5, 2024

Suggest a model to fine-tune for multi-class classification on a 32GB M1 Mac?

Related topics