Predict next embedding given sequence of embeddings

samhedin · February 16, 2023, 2:04pm

Hi!
I’m planning a network that will take a sequence of embedding vectors as input, and produce several vectors as output.
Given a sequence of text embeddings produced by bert, I want to predict the next embedding.
Example input/output:

 # The user first read text with embedding a, 
# and then read text that with embedding b. 
input: [[a1, a2, a3], 
        [b1, b2, b3], ...]

# The network predicts that they will want to read something similar to x or y
# Once x and y are produced
# I will search a database of [text <> embedding] pairs to find relevant text.
output: {[x1, x2, x3], 
         [y1, y2, y3], ...}

The input vectors are produced by sentence-bert https://www.sbert.net/
Is HF a natural fit for this task? If so, where should I start?

Topic		Replies	Views
Creating word embeddings using BERT of machine generated sequential data Models	0	265	April 7, 2023
Produce similar embeddings to another model with BERT 🤗Transformers	0	343	December 15, 2021
In BertForMaskedLM, how to return as output the predicted embedding? Beginners	0	481	February 4, 2021
Further train bert with next sentence prediction head using tensorflow 🤗Transformers	4	1562	July 1, 2021
Vector2sequence approach Beginners	0	219	December 15, 2021

Predict next embedding given sequence of embeddings

Related topics