Hey @DanielPezoa,
A 32 GB GPU should be big enough to fine-tune the model actually … do you use config.gradient_checkpointing=True
?
Also, it would be interesting at what batch_size you are able to fine-tune the model => setting the batch size to 8 in combination with gradient checkpointing should definitely work