I’m trying to use Wav2Vec2Model with a multi-channel input. For this, I have edited the 1st layer in the feature_extractor of Wav2Vec2Model. Code: from transformers import Wav2Vec2Model, Wav2Vec2Config configuration = Wav2Vec2Config() model = Wav2Vec2Model(configuration) model.feature_extractor…

Wav2Vec2Model: Expected 3-dimensional input for 3-dimensional weight [512, 10, 10], but got 4-dimensional input of size [16, 1, 10, 1000] instead

loretoparisi July 22, 2021, 4:11pm 2

this problem seems to be related to

Topic		Replies	Views
HuBERT: RuntimeError: Expected 3-dimensional input for 3-dimensional weight but got 5-dimensional input Models	0	1042	July 22, 2021
Wav2Vec2ForCTC abandons one logit sometimes Models	1	428	October 26, 2022
Wav2vec2 Acces Feature Layers Performance 🤗Transformers	1	452	May 7, 2025
Cannot train Wav2Vec2 processor with Wav2Vec2 or HuBERT Beginners	3	383	July 17, 2024
Input type (torch.FloatTensor) and weight type (torch.cuda.FloatTensor) should be the same or input should be a MKLDNN Beginners	0	778	March 5, 2023