How to set vocabulary size?

hadiqa123 · September 19, 2022, 5:59pm

Hello everyone.

I am having an issue with the vocabulary size. as shown in the screenshot below. Can anyone help me why is this happening?

The following is my line of code:

from transformers import Wav2Vec2ProcessorWithLM
processor = Wav2Vec2ProcessorWithLM.from_pretrained(“BakhtUllah123/xlsr_ur_training”)
logits.shape
“.”.join(sorted(processor.tokenizer.get_vocab()))
transcription = processor.batch_decode (logits.numpy()).text

Topic		Replies	Views
Vocabulary count mismatch when loading the previously created tokenizer 🤗Transformers	0	168	January 8, 2024
Wav2Vec2 ASR Fine tuneing Improvement Beginners	0	174	November 7, 2023
Improving performance of Wav2Vec2 fine tuning with word piece vocabulary Research	5	2994	October 27, 2021
Vocab_size value for facebook/w2v-bert-2.0 Models	0	253	November 13, 2024
Fine-Tune Wav2Vec2 for English ASR with 🤗 Transformers article bug Beginners	15	2734	March 7, 2024

How to set vocabulary size?

Related topics