Hi @aswincandra the state_dict
is just a Python dict
that maps each layer to its corresponding tensors: What is a state_dict in PyTorch — PyTorch Tutorials 1.7.1 documentation
The reason I mentioned it is because I think your error is coming from this line of the from_pretrained
function: transformers/modeling_utils.py at 748006c0b35d64cdee23a3cdc2107a1ce64044b5 · huggingface/transformers · GitHub
Right now you can’t see the lower-level error message from PyTorch, so trying to load it directly might shed some light on what the problem is