System info are:
I just run the official example scripts on transformers/examples/pytorch/language-modeling at eca77f4719531ecaabe9ec6b2dee6075a391d98a · huggingface/transformers · GitHub about bert mlm train.
and I get a warning info like:
python3.8/site-packages/torch/nn/parallel/_functions.py:68: UserWarning: Was asked to gather along dimension 0, but all input tensors were scalars; will instead unsqueeze and return a vector.
I want to know is this a normal situation? Thanks