BERT for Speech

naruto7 · February 22, 2021, 6:23pm

How can I use HF’s BERT models for speech-to-text training?

rgwatwormhill · February 24, 2021, 9:01pm

Not easily.

BERT expects tokenized inputs, where natural language text has been coded (tokenized) as numbers. To use BERT for speech, you would need to convert your audio to similar tokens.

If you want to use a pre-trained BERT model, then you would need to use exactly the same tokens. If you want to train a BERT model from scratch then you could define your own tokens.

To learn more about tokenizing, try this BERT Word Embeddings Tutorial · Chris McCormick

Topic		Replies	Views
Do you need to use the associated tokenizer Beginners	2	569	June 6, 2022
Bert for audio classification Research	0	1158	April 25, 2022
Tokenizer vs Model 🤗Tokenizers	0	251	June 24, 2024
Fine-tuning BERT Model on domain specific language and for classification 🤗Transformers	7	8426	November 14, 2024
Pre-training a BERT model from scratch with custom tokenizer Intermediate	5	3095	January 11, 2022

BERT for Speech

Related topics