Does anyone know why the raw version of the huggingface go_emotions dataset has 211k rows when there should only be 58k rows as per the simplified version?
1 Like
The simplified version seems to have been filtered based on reter-agreement according to the home page here: https://github.com/google-research/google-research/tree/master/goemotions
1 Like