It is a normal phenomenon: 'encoder.embed_tokens.weight'
will be initialized randomly (using self.shared
) instead of pre-trained weights.
This warning was ignored in the newest version of the transformer
.
And of course, it is harmless; you can feel free to ignore this warning.
Please refer to this comment for more details.