Output of Pyramid Vision Transformer

Sushmitaupadhyay May 3, 2024, 4:25am 1

What does the output of PVTModel represent? Is it image patch like ViT or feature map like CNN?

Topic		Replies	Views
Vision Transformer reconstruct image 🤗Transformers	2	1108	July 21, 2022
What is the correct way to create a feature extractor for a hugging face (HF) ViT model? Intermediate	1	1051	April 6, 2023
ViTImageProcessor output visualization 🤗Tokenizers	8	690	April 18, 2024
Using trasnsformer to get image features 🤗Transformers	3	3343	March 20, 2024
Pyramid Vision Transformer: Issue with input image size larger than 224 px 🤗Transformers	0	1554	September 15, 2023