Error io.BufferReader

It looks like some samples can’t be read (maybe corrupted files ?) when you access them using your for loop.

You can disable Audio decoding to iterate on your dataset without reading the audio data and to avoid this error:

from datasets import Audio

dataset = dataset.cast_column("audio", Audio(decode=False))
for example in dataset:
    ...