Hi there,
I’m trying to convert a modified bert embedding back to text.
I’m shooting for a pipeline like the following:
- input_text
- Tokenizer
- Bert Base Uncased
- Add noise to output tensor
- Attempt to reconstruct original text
I’m somewhat lost on step 5. I have a perturbed torch tensor but how do I convert that back to a sentence? I’m guessing with some logit layer that using the same vocabulary as the tokenizer?
Thanks in advance!