Finetuning model with smaller sequence size and Dmodel

ierezell · April 15, 2021, 3:20pm

Hi all !

I wanted to know if there is any way to finetune a model but with smaller parameters (for exemple : sequence length from 512 to 64 and dModel from 768 to 256).

I could use the pytorch TransformerEncoder and train my own from scratch but I wanted to know if there were some fine-tunning possible (i.e not starting from scratch).

Thanks in addvance for any help,

Have a great day !

Topic		Replies	Views
Anyone have idea how we can finetune a model using Trainer API? 🤗Transformers	0	446	April 22, 2022
Finetuning using transformers 🤗Transformers	0	239	May 26, 2023
Should 24GB of VRAM be able to fine tune a 1B model? Beginners	9	657	February 23, 2025
CLIPModel finetuning Models	9	9218	July 20, 2022
Finetuning options with SAM? Models	4	5233	May 11, 2023

Finetuning model with smaller sequence size and Dmodel

Related topics