Can't load GLUE/MNLI dataset from Hub due to schema issues

Hello, I’m trying to load the MNLI dataset from GLUE on Databricks using datasets 2.13.2 and facing the error depicted in the image below. Since it’s a standard and widely used dataset I was expecting it to work out-of-the-box. Any special config I should provide?

Thanks in advance, Felipe

Hi! You can avoid this error by updating datasets (pip install -U datasets) to a newer version and then downloading the dataset.

Hi Mario, thanks for the quick reply. I did but still getting the below:

Found the problem. It was missing pyspark.

With pyspark 3.5.1 and datasets 2.17.1 everything is working now.

Thanks for the help.

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.