Using transformers (BERT, RoBERTa) without embedding layer

agemagician · December 13, 2020, 10:00pm

Yes, it will work. It can give you a very close results compared to MSA methods, sometimes even better results. If you combine it with MSA, it will even give you a better results compared to MSA methods alone.

We have trained (Transformer XL, XLNet, Bert, Albert, Electra and T5) for Uniref100 and BFD dataset. I would recommend to simply use on of these models, because it requires tremendous amount of computing power to reach good results.

You can find them here:

You can find more details on our paper:

Facebook also trained Roberta using Unrief50 dataset:

Unfortunately, we don’t have a notebook for training from scratch, but you can find more details to replicate our results here:

@patrickvonplaten :
You meant :

Not :

ProtTrans: Provides the SOT pre-trained models for protein sequences.
CodeTrans: Provides the SOTpre-trained models for computer source code.

Topic		Replies	Views
How could protein language models generate outputs for natural language input texts? 🤗Transformers	4	419	November 21, 2023
Transformers with protein data Beginners	0	323	July 6, 2022
PreTrain ProteinBERT from scratch Flax/JAX Projects	5	2319	July 6, 2022
How to Fine-tune Rostlab/prot_t5_xl_uniref50 Model for Sequence Generation Beginners	0	389	April 5, 2023
Unmasker probabilities for all tokens in sequence 🤗Transformers	0	223	December 23, 2022

Related topics