Using External Datasets with HuggingFace Data Loader

ViT expects 3 input channels by default.

1 Like