Extracting logits from vision language models at inference time

alzaia · March 6, 2024, 5:37am

Hello, is there a simple way to extract the output token(s) logits when running inference on a VLM natively integrated into transformers (e.g. while using pipeline or AutoML classes)? For instance, I would expect to be able to get them via outputs.logits in the forward pass. Would this be doable with currently integrated models such as LLaVA or CogVLM?

Topic		Replies	Views
How to convert model output logits into string sentences during training to check what the model is outputting? 🤗Transformers	3	5160	October 14, 2021
How can I obtain the logits via model.generate()? 🤗Transformers	2	2579	October 8, 2024
How do I get logits from an Inference API Wav2Vec2 model? Inference Endpoints on the Hub	1	58	August 6, 2024
Extracting Logits From T5 Output Beginners	5	2078	January 9, 2024
Ask for help: Output inconsistency when using LLM batch inference compared to single input Beginners	4	175	March 20, 2025

Extracting logits from vision language models at inference time

Related topics