Sliding Transformer into a long sequence

mdelas · August 4, 2022, 2:57pm

Hello everyone,

I have very long genome sequences where I have to do some classification stuff on top. What I want to try is to use a transformer to predict the next token from the 512 chunk sequence and slide this transformer to the whole sequence and use those to work on top of the whole sequence.

Let’s say an example: Imagine I have a 250.000 token sequence, I should slide the transformer 488 times producing 488 tokens. Concatenate this output to obtain a summary array of the sequence and build a classifier on top of it.

I’m trying to find any examples that could guide me in this direction but I hardly can find any of them. Does someone think that’s a good idea? Where could I look for some near examples of sliding a transformer/LSTM over a longer sequence?

Thank you very much, I’ll appreciate everything!

rwheel · August 11, 2022, 6:47am

Hi @mdelas,

I’m not very familiar with your problem but I found a thread in the forum that seems to be similar to yours but for question answering task: Handling long text in BERT for Question Answering

mdelas · August 19, 2022, 4:25pm

Thank you very much @rwheel for your proposal! I will follow and try to figure out how can I use it. I posted and edited this question on deep learning - Sliding Transformer model into longer sequence - Stack Overflow . Do you have some other references of similar work over there?

rwheel · August 20, 2022, 8:17am

No, I haven’t read anything else about it. However, if I find something of interest, I will let you know

Have a good day.

Topic		Replies	Views
How to train with very long sequences? Beginners	2	691	May 20, 2022
How to train transformer (seq-to-seq) for very large seq? 🤗Transformers	0	251	October 4, 2021
Modeling long sequences Models	0	460	June 9, 2022
Handling long text in BERT for Question Answering Beginners	7	11951	March 10, 2022
Sequence Classification Long Documents Beginners	1	543	June 9, 2022

Sliding Transformer into a long sequence

Related topics