After some further research, it seems like the parameter comes from PretrainedConfig in configuration_utils.py
I am still not sure if this parameter is used during training or what effect it has.
After some further research, it seems like the parameter comes from PretrainedConfig in configuration_utils.py
I am still not sure if this parameter is used during training or what effect it has.