For your side quest; Add a torch.set_default_device(device) prior to loading the model. The message is harmless anyway as you are moving the model to a device after creating it.
For additional tokens - I believe you have to train, finetune because adding tokens causes the model to lose some of its weights.