I was playing around with Wav2vec2 a bit more and there were a few more things I noticed that were off, firstly in the Wav2vec2Procesor whenever we pad, we then apply the normalization which in turn changes the padded value per sequence. Would it not make more sense to let the padded value be as is and apply the normalization before the padding? Also, Some parameters like padded_value
and do_normalize
do not work either. I tried saying do_normalize = False
but it still normalized. In the same case with padded_value
, I tried passing a few different values but the output from the processor did not change one bit. I think this is because we do not pass the padded_value to the pad function
Shouldn’t we pass the other parameters here as well?
Let me know where I am wrong, I am not very experienced with ML/DL models yet so all of this might be just my misunderstanding