HUBERT Implementation with increased vocabulary size

Soumyanetra · May 13, 2024, 2:07pm

i am using the off the shelf Hubert for CTC model with just one change i.e vocab_size=60 and it is not working (predicting all tokens to be pad token) while it is working with 32. what could be the solution?

class ASR(nn.Module):
def _init_(self, vocab_size = 32, ignore_mismatched_sizes = True, *args, **kwargs):
super(ASR, self)._init_(*args, **kwargs)
self.hubert_ctc = HubertForCTC.from_pretrained(“facebook/hubert-large-ls960-ft”, vocab_size=vocab_size, ignore_mismatched_sizes=ignore_mismatched_sizes)
def forward(self, input_values, attention_mask=None, labels=None,**kwargs):
    if(attention_mask is None):
        attention_mask = torch.ones_like(input_values)
    out = self.hubert_ctc(input_values, attention_mask, labels=labels, return_dict=True)
    return out.loss,out.logits

I have checked the logits-
predicted_ids = torch.argmax(logits, dim=-1)
dec = self.processor.batch_decode(predicted_ids,skip_special_tokens=True)

all predicted ids are coming to be zero(pad id)

but this works perfectly fine with vocab_size=32

Topic		Replies	Views
Hubert ASR Fine Tuning giving weird results Models	1	1332	January 14, 2022
HuBERT: RuntimeError: Expected 3-dimensional input for 3-dimensional weight but got 5-dimensional input Models	0	1042	July 22, 2021
Questions about vocab size, decoder start token, padding token, and appropriate config for custom seq2seq transformer model without any tokenizer 🤗Transformers	0	52	October 11, 2024
Pretraining RoBERTa from scratch breaks down when using tokenizer with smaller vocabulary Beginners	2	1677	March 7, 2021
Loading trained model with new vocab Beginners	2	1091	April 10, 2024

HUBERT Implementation with increased vocabulary size

Related topics