Hi @Borell, can you provide a reproducer. My hunch is that you are not initializing the correct model architecture under init_empty_weights
.
Hi @Borell, can you provide a reproducer. My hunch is that you are not initializing the correct model architecture under init_empty_weights
.