How to implement learnable position embed?

davidshen84 · July 16, 2023, 6:18am

Hi,

In the ViT paper, the author said they the standard learnable 1D position embeddings. I want to implement using Flax.

If the initial status of the embeds is random, I think I can just use the nn.Embed class to initialize the embeds. But how do I apply the embeds to the inputs?

This is how I think it could be done. I wonder if it makes any sense.

class PositionEmbed(nn.Module):
    dtype: Any = jnp.float32

    @compact
    def __call__(self, x):
        '''
        x: [N, L, D]
        '''
        embed = nn.Embed(x.shape[1], x.shape[-1])(jnp.arange(x.shape[1]))

        batch_apply = jax.vmap(lambda x_: x_ + embed)

        return batch_apply(x)

Topic		Replies	Views
Train REINFORCE with JAX Flax/JAX Projects	0	569	July 15, 2023
How to use custom positional embedding while fine tuning Bert Beginners	2	2785	September 14, 2022
FLAX/JAX Learning Resources Flax/JAX Projects	6	2524	July 1, 2021
Is resize_token_embeddings available to the FlaxPreTrainedModel? Flax/JAX Projects	1	1762	August 25, 2022
Why positional embeddings are implemented as just simple embeddings? Beginners	7	8110	October 27, 2023

How to implement learnable position embed?

Related topics