Wrong tensor shape when using a model: TypeError: Cannot handle this data type: (1, 1, 1280, 3), |u1

nielsr · January 8, 2024, 8:08am

Hi,

VideoMAE models are trained with a certain number of frames, which you can see from model.config.num_frames. This is typically 16 or 32. In other words, you’ll need to sample 16 or 32 frames from the video, which are then provided to the model. See the example code snippet for details regarding sampling frames.

Also, the channels are first rather than last. This is documented here: VideoMAE.

Topic		Replies	Views
Model input shape doesnt match Beginners	2	21	April 12, 2025
Resolve TypeError: expected Tensor as element 1 in argument 0, but got NoneType 🤗Transformers	0	407	June 21, 2024
Error while loading the model using safe tensors 🤗Transformers	0	630	July 11, 2023
Model doesn't accept int32 Beginners	0	97	April 29, 2024
Attribute error "nonetype object has no shape" Beginners	4	25	June 11, 2025

Wrong tensor shape when using a model: TypeError: Cannot handle this data type: (1, 1, 1280, 3), |u1

Related topics