Whisper Inference RuntimeError: The expanded size of the tensor (3000) must match the existing size (3392) at non-singleton dimension 1. Target sizes: [80, 3000]. Tensor sizes: [80, 3392]

kavyamanohar · December 29, 2023, 9:48am

I have trained a fine-tuned whisper model for an ultra low resource Malasar language.

While doing inference to evaluate the performance on a test set using the code, It performs decoding for few entries in data_test and then produces the error given in the title.

from tqdm import tqdm
from transformers.pipelines.pt_utils import KeyDataset

all_predictions = []

# run streamed inference
for prediction in tqdm(
    pipe(
        KeyDataset(data_test, "audio_path"),
        max_new_tokens=1024,
        generate_kwargs={"task": "transcribe"},
        batch_size=8,
    ),
    total=len(data_test),
):
    all_predictions.append(prediction["text"])

What does that error indicate, and what could be the possible solutions?

VladS159 · May 13, 2024, 4:28pm

Did you find out what was the problem?

Topic		Replies	Views
Trainer RuntimeError: The size of tensor a (462) must match the size of tensor b (448) at non-singleton dimension 1 🤗Transformers	17	44725	May 23, 2024
The size of tensor error while fine tuning whisper Beginners	1	560	February 13, 2024
RuntimeError: The size of tensor a (553) must match the size of tensor b (448) at non-singleton dimension 1 Beginners	3	1087	July 17, 2024
RuntimeError: The expanded size of the tensor (31) must match the existing size (7) at non-singleton dimension 0. Target sizes: [31]. Tensor sizes: [7] Beginners	0	185	May 23, 2024
RuntimeError: The size of tensor a (4096) must match the size of tensor b (4097) at non-singleton dimension 3 Models	1	438	August 24, 2024

Whisper Inference RuntimeError: The expanded size of the tensor (3000) must match the existing size (3392) at non-singleton dimension 1. Target sizes: [80, 3000]. Tensor sizes: [80, 3392]

Related topics