Hi,
Here’s how you can visualize the output of ViTImageProcessor:
from transformers import ViTImageProcessor
import requests
from PIL import Image
image_processor = ViTImageProcessor()
url = 'http://images.cocodataset.org/val2017/000000039769.jpg'
image = Image.open(requests.get(url, stream=True).raw)
pixel_values = image_processor(image, return_tensors="pt").pixel_values
# denormalize the pixel values for visualization purposes
mean = image_processor.image_mean
std = image_processor.image_std
unnormalized_image = (pixel_values[0].numpy() * np.array(std)[:, None, None]) + np.array(mean)[:, None, None]
unnormalized_image = (unnormalized_image * 255).astype(np.uint8)
unnormalized_image = np.moveaxis(unnormalized_image, 0, -1)
unnormalized_image = Image.fromarray(unnormalized_image)
which gives me this:
This is a 224x224 image.