Speech2TextModel does not support small d_model

AndrewZN00 · April 5, 2024, 5:54pm

I am trying to use Speech2TextModel with a very small d_model configuration (i.e., 2) but it raises the ZeroDivisionError error. Here is the simplified program to reproduce

from transformers import Speech2TextModel, Speech2TextConfig
import torch
with torch.inference_mode():
    cfg = Speech2TextConfig(d_model=2)
    model = Speech2TextModel(cfg)

Error:

emb = math.log(10000) / (half_dim - 1)
ZeroDivisionError: float division by zero

Is this the expected behavior? How can I use the speech2text model with d_model=2?

Topic		Replies	Views
RuntimeError: grad can be implicitly created only for scalar outputs 🤗Transformers	0	1053	August 10, 2023
Unable to find Speech2Text model Models	0	227	March 5, 2021
Model generating incorrect prediction Models	1	500	September 21, 2022
Is there a complete Speech2Text example? 🤗Transformers	0	581	November 24, 2021
ERROR: Could not find a version that satisfies the requirement torch==1.7.1+cpu Beginners	17	25176	December 15, 2020

Speech2TextModel does not support small d_model

Related topics