I wanted to make a new language model for protein sequences because i was getting good accuracy on my classification task when I use word2vec. I am not able to get any example on this as most of them are for
fill-mask. Is there an example I can follow for sentence classification? I am able to complete the tokenizing step and just have some issues with the training step.