Sounds interesting @kees, I was infact working on something similar → a jax/flax implementation of ProteinBERT. Pretraining RoBERTa seems like a good strategy. I’m definitely interested.
I think one should be able to slightly tweak https://github.com/huggingface/transformers/blob/master/examples/flax/language-modeling/run_mlm_flax.py for it to work well with BigBird Also pinging our “BigBird” expert @vasudevgupta
Thanks @patrickvonplaten for pinging me here. Training bigbird on dna-sequences sounds so interesting (Always wanted to do that ). I would also be happy to work on this one.
Hi everyone, this project seems very interesting and I would love to join, if you wanna know a little more about my background check out my GitHub, I have worked on a somewhat similar problem.
Hey @valhalla, I’m still not sure if I want to be a part of this project or the ProteinBERT project. I’d rather be a part of ProteinBERT, incase it gets accepted
Do the attention patterns that Big Bird uses have any biological sense or could they deteriorate performance for some tasks?
I mean, BigBird induces sparsity to the model using different attention patterns (see figure below) that have a certain linguistic sense, however, I don’t know if they can be easily extrapolated to relationships between k-mers.
Thanks @patrickvonplaten, I didn’t know if one could participate in two projects. I’ve done some work on ProteinBERT already, so working on BigBirDNA would be a learning opportunity. Although I’m not sure how much time I’ll be able to give to two projects. For now I’ll stick to ProteinBERT.
I didn’t train and compare both of them for DNA sequences.
However, the main problem that DNA sequences are extremely long, and the only option to train them is using an efficient transformer.
Currently, AFAIK, the only efficient transformer that was ported to Trax is BigBird.
Hey @patrickvonplaten, I’m almost done with my set of contributions to the ProteinBERT project. My work was mostly focused on creating the model architecture, and I’ve almost completed the work. I was wondering if I could join the BigBird team as well. I know it’s kinda late but if it’s okay with you guys and the members of the group, I’d love to contribute to the BigBirdDNA project as well.
It would be awesome if you can jump in - I am a little bit in over my head with other tasks at the moment so the project has not booked a lot of progress just yet