How to Fine-tune Rostlab/prot_t5_xl_uniref50 Model for Sequence Generation

littleworth · April 5, 2023, 12:00pm

Dear Sir,

I’d like to fine-tune the pre-trained model Rostlab/prot_t5_xl_uniref50.

The dataset I plan to use for fine tuning looks like this (colon separated):

P R T <extra_id_0> I N S E Q W <extra_id_1> E N C E :  P R T K I N S E Q W H E N C E 
G M M  <extra_id_0>  <extra_id_1> K P H G : G M M V E K P H G
R H G L  <extra_id_0>  <extra_id_1> : R H G L Q F

...etc...

The first column is the input from a user and the output is on the right.
The output is simply the ‘active’ protein inferred from experiments.

Unfortunately, I have been unable to find a suitable example in the ProtTrans repository.

Additionally, I am curious if the Hugging Face script run_mlm.py can be utilized with the aforementioned pre-trained model.

I would truly appreciate your insights and recommendations on how to proceed with this task. Thank you in advance for your time and consideration.

Rgds,
littleworth

Topic		Replies	Views
Need help in fine-tuning T5-Base Model for a sequence task Beginners	0	168	May 8, 2024
E5 embedding models 🤗Transformers	1	19	March 17, 2025
Finetuning T5 for Summarisation - Poor results Intermediate	1	528	April 28, 2024
How to do sequence fine tuning? Beginners	5	740	July 22, 2020
Errors when fine-tuning T5 Beginners	7	6469	January 3, 2022

How to Fine-tune Rostlab/prot_t5_xl_uniref50 Model for Sequence Generation

Related topics