Sst2 dataset labels look worng

cjgs · October 19, 2021, 9:31am

Hello all,

I feel like this is a stupid question but I cant figure it out

I was looking at the GLUE SST2 dataset through the huggingface datasets viewer and all the labels for the test set are all -1.

They are 0 and 1 for the training and validation set but all -1 for the test set.

Shouldn’t the test labels match the training labels? What am I missing?

nielsr · October 19, 2021, 11:11am

GLUE is a benchmark, so the true labels are hidden, and only known by its creators.

One can submit a script to the official website, which is then run on the test set. In that way, one can create a leaderboard with the best performing algorithms.

cjgs · October 19, 2021, 11:04pm

Thank you!

Topic		Replies	Views
Nlp course: Why I Fine-tune a model on the GLUE SST-2 dataset but get worse score compare to Bert(base) Course	0	534	July 27, 2023
AttributeError: 'TrainOutput' object has no attribute 'metrics' when finetune custom dataset 🤗Transformers	3	2512	January 4, 2021
I am getting bad performance when evaluating on Huggingface test dataset (GLUE dataset) 🤗Transformers	0	291	October 26, 2021
Sequence Classification -- Fine Tune? Beginners	3	3136	January 31, 2021
How to label dataset for Causal Language Modeling Beginners	0	521	January 27, 2023

Sst2 dataset labels look worng

Related topics