Wav2vec2.0 memory issue

EmreOzkose · March 22, 2021, 9:11am

Since it takes too long to load data, It might be helpful to share normalized histogram of my dataset.

normalized number of sample list:
[‘0.1512’, ‘1.0000’, ‘0.8367’, ‘0.8265’, ‘0.6045’, ‘0.2867’, ‘0.1256’, ‘0.0611’, ‘0.0330’, ‘0.0189’, ‘0.0103’, ‘0.0057’, ‘0.0034’, ‘0.0017’, ‘0.0011’, ‘0.0006’, ‘0.0004’, ‘0.0002’, ‘0.0001’]

corresponding seconds:
[ 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11,
12, 13, 14, 15, 16, 17, 18, 19]

I have sounds which are more than 19sn. It can be a problem. I think padding is done in batch (default behavior), so each batch have different shape. First batch may have long duration. I might check a subset which is restricted to less than 6sn.

Topic		Replies	Views
German ASR: Fine-Tuning Wav2Vec2 Languages at Hugging Face	17	3686	February 18, 2022
Wav2vec2-xls-r-2b out of memory issues on A100 (40 GB) Models	0	685	May 13, 2022
How to finetune wav2vec2.0-xlsr model with long audio files Beginners	1	833	September 6, 2022
How much memory to fine tune wav2vec2? Models	2	1164	March 7, 2022
Wav2vec2 finetuning custom dataset 🤗Transformers	2	2465	December 25, 2024

Wav2vec2.0 memory issue

Related topics