Pruning a model embedding matrix for memory efficiency

Yes this seems like the right approach.
When you get to step 4/5 you can just make a new Tokenizer.
If you get it working please post the solution here!

1 Like