Load/save HF block sparse model

vblagoje · October 20, 2020, 9:07am

Hey everyone,

I am exploring https://github.com/huggingface/pytorch_block_sparse project. One of the issues that popped up almost immediatelly is loading a saved “sparsified” model. So, let’s say you sparsified Roberta using an example provided . Now that the model has been sparsified (it’s linear layers replaced with BlockSparseLinear nn modules) how can I load previously saved model using HF ecosystem? All I can think of is that I again need to create a Roberta model with uninitialized weights, sparsify it, and the load weights with model.load_state_dict(torch.load(PATH))?

Am I overlooking something obvious?

vblagoje · October 21, 2020, 1:47pm

No mechanism in place for loading as of now, which is ok. I sparsed the model again and loaded the weights manually via model.load_state_dict(torch.load(PATH)).

Topic		Replies	Views
Can't load weights for 'hfl/chinese-roberta-wwm-ext-large'. Beginners	2	867	July 22, 2020
Load quantized model in memory Beginners	1	587	December 8, 2023
How to save and load fine-tune model 🤗Transformers	4	24691	October 25, 2021
Saving/Loading custom model build from varying HF models Intermediate	1	1350	March 20, 2023
Different embeddings when load model from_tf and save to torch 🤗Transformers	0	379	February 28, 2023

Load/save HF block sparse model

Related topics