Edited this second post as I had confused myself lol. Are all models subject to different methodologies for determining minimum threshold for block_sparse?
Edited this second post as I had confused myself lol. Are all models subject to different methodologies for determining minimum threshold for block_sparse?