Just tried running the finetuning code plus some minor modifications on an EC2 instance with a V100 and it just wasn’t enough, even when reducing the batch size.
What are yall’s experiences when using the Wav2Vec2 big models? Especially the XLSR multilingual model?