Hello. How can I create a model object and skip the random initialization of weights? The random initialization is time consuming and unnecessary for my case, as I want to load the weights using torch.load_state_dict
. For instance, see the code below.
config = BloomConfig.from_pretrained("bigscience/bloom")
block = BloomBlock(config) # initializes weights randomly, which is time consuming
block.load_state_dict(torch.load("path_to_pytorch_bin"))