Error Training Vision Encoder Decoder for Image Captioning

I checked out my vocab size and length of tokenizer, they are different

Vocab size - 50257
Tokenizer length - 50258

Is that causes index error?

1 Like