Run on single local file rather than dataset

Peterzoura · January 30, 2024, 3:19am

Hello, I am studying in Uni and trying to learn how to use huggingface libraries.

I wanted to do a simple test of recording myself and running an Automatic Speech Recognition pre-trained model but it looks like hugging face libraries only support using datasets. Is there any way to do this? Will I have to put my audio recording into a dataset format? How can I accomplish my goal?

Thank you.

Hakase-Noonna · January 30, 2024, 6:42am

HI, I am not in the audio recognition field. But I assume the framework is same as other tasks.

It’s better to use Huggingface Dataset format, but It’s not compulsory.
It’s very flexible to switch to Pytorch or TF format.

For example, in Language field, I first use pandas DataFrame.
Second, Convert to HF Dataset by using Dataset.from_pandas(DF)
After tokenizing, I convert it to pytorch format using DataLoader class.

Also I use native pytorch LLM model which is HF pre-trained model.

Just take some time for reading tutorials and docs, then you’ll find out. (It took more than a month for me since I am a slow leaner. )
GL!

Topic		Replies	Views
How to embed Hugging Face Pre-trained models in our own app Beginners	2	898	March 26, 2021
Loading custom audio dataset and fine-tuning model Beginners	6	3238	December 12, 2023
Audio dataset without uploading the data to the hub 🤗Datasets	6	1957	March 20, 2023
How to create a dataset for "audio-like" files for ASR Beginners	0	402	April 10, 2023
How to do that trained huggingface model speech recognation? DeepSpeed	0	402	December 10, 2021

Run on single local file rather than dataset

Related topics