Can BERT take numerical values as input for masked time-series modeling?

dbbgia · October 28, 2022, 1:21pm

I have many series of trajectories [[x1, y1, z1], [x2, y2, z2], ... [xn, yn, zn]] of objects I’ve been tracking in imaging. Some of the time points [xi, yi, zi] are missing and I’d like to impute these coordinates [x_hat, y_hat, z_hat] - a problem I see very similar to masked language modeling!

Conceptually the transformer makes sense but I am stuck on the most trivial step! Can numerical values be used as input to a transformer like BERT?

I don’t need to “tokenize” my input, and that is part of my confusion
another is my problem has no “vocabulary”, since the outputs are numerical values (normalized between 0 and 1).
Do I have to use a special architecture (ie. BEiT?).

Villota44 · December 18, 2022, 8:01pm

Hi there!
I was about to ask the same question and just read yours.
I have a similar problem. I have a sequence of heights and speed limits. I want to get the most efficient speed for the whole track. I would like to use attention as the solution has a lot of long-term dependencies.
However, right now I do not see a way of introducing a numeric sequence directly to the models.

Also, if useful, It iS true that for your problem you will not need to transform words into vectors and therefore a tokenize function as defined in the course, but I think you will still need to define the special characters (bos, eos…) to let the model know when to start and finish.

Hope we can get an answer ASAP!

Topic		Replies	Views
Numeric embedding input for transformers Beginners	0	983	June 13, 2023
Using time series for SequenceClassification models 🤗Transformers	2	4270	September 7, 2022
Masking task with BERT on time serires Research	0	26	October 21, 2024
Pre - Train model with inputs_embeds 🤗Transformers	0	375	July 4, 2023
Temporal Information 🤗Transformers	0	302	October 8, 2020

Can BERT take numerical values as input for masked time-series modeling?

Related topics