"too many values to unpack (expected 4)" but pixel_values dimension is correct

Here is what I see as the shape of pixel_values: