ASTFeatureExtractor

Hi,

I’m working in a Master’s Dissertation to predict music popularity using AST model.

I’m looking now at the ASTFeatureExtractor here: Audio Spectrogram Transformer that converts audio raw files to Mel spectrograms.

Looks like ‘max_length’ parameter of ASTFeatureExtractor default value is 1024. To me, 1024 means that only the first 10.24 seconds of each song will be inserted to the model. Anyone can confirm that?

Regards

1 Like