Sparse Transcoders for Qwen2.5-VL-7B

KokosDev · October 3, 2025, 4:31am

Hi everyone,

I’ve released sparse transcoders for Qwen2.5-VL-7B-Instruct on HuggingFace.

Technical specs:

Use cases:

All code and documentation included in the repo.

Next: Training the 32B version (64 layers, 12K features)!

Questions and feedback welcome.

Ernst03 · October 3, 2025, 5:13pm

Welcome to posting @KokosDev

Congrats on the release.

Topic		Replies	Views
Torchscript with Encoder-Decoder architecture Intermediate	0	304	October 11, 2021
Model() output issue during migration from pytorch_pretrained_bert to transformers 🤗Transformers	0	553	September 15, 2020
Transformers v3.0.0 is out! 🤗Transformers	0	1964	July 7, 2020
How to visualize the features of encoder output of an encoder-decoder model? Beginners	0	329	May 2, 2021
Extract visual and contextual features from images Models	5	4485	August 27, 2021