Hi, in the documentation, it only states how to add audio files, but I want to add audio files and their transcriptions.
How can I do that so I can build a dataset of snippets / transcription that I can train on?
Also, if I want to have 2 separate datasets, one for test and one for training, what’s the approach to follow? Send everything and tag in the metadata.csv or create 2 folders and upload the snippets/transcription with?
Also, if I want to have 2 separate datasets, one for test and one for training, what’s the approach to follow? Send everything and tag in the metadata.csv or create 2 folders and upload the snippets/transcription with?
@lhoestq Thanks.
I have a local dataset currently and i created a datatset instance to use it with trained.train.
I use a batch size equals 4. However i do see that my GPU memory is almost full.I was wondering if trainer.train loads all the data into the memory though I have specified the batch size?
Just to give some idea, my train size is 84K samples with each samples being of length (16000,) (which is 1second of audio). My eval set is also of 84K samples.