Decoding a CLIP embedding

fredguth · October 29, 2022, 7:11pm

I can transform a text (prompt) into clip embeddings with:

prompt -> tokenizer -> tokens -> CLIPTextModel.from_pretrained -> embeddings

I would like to decode an embedding to a prompt:

embeddings -> ??? -> tokens -> tokenizer -> prompt

How do I convert CLIP embeddings into tokens?

fredguth · November 1, 2022, 3:08pm

I figured out that the CLIPTextModel is a lossy encoder, thus there is no direct way to decode an embedding.

Topic		Replies	Views
Provide CLIP embeddings directly to diffuser Beginners	0	316	August 5, 2023
Providing embeddings directly to the diffusion pipeline 🧨 Diffusers	0	351	August 4, 2023
Diffusers load custom embedding 🧨 Diffusers	0	47	November 7, 2024
Embeddings from the Decoder only model Research	5	1355	March 26, 2025
How to obtain correct text embeddings from CLIP? 🤗Transformers	1	8898	February 6, 2023