Run on single local file rather than dataset

HI, I am not in the audio recognition field. But I assume the framework is same as other tasks.

It’s better to use Huggingface Dataset format, but It’s not compulsory.
It’s very flexible to switch to Pytorch or TF format.

For example, in Language field, I first use pandas DataFrame.
Second, Convert to HF Dataset by using Dataset.from_pandas(DF)
After tokenizing, I convert it to pytorch format using DataLoader class.

Also I use native pytorch LLM model which is HF pre-trained model.

Just take some time for reading tutorials and docs, then you’ll find out. (It took more than a month for me since I am a slow leaner. :stuck_out_tongue_winking_eye:)
GL!