I’m trying to replicate the DSI paper released by google research. In this paper, they extend the output vocabulary to support directly ouput docids. But I don’t know how they implement this. Is there any ways to add new tokens in the vocabulary of embedding layer in T5? Is this function supported by some API or do I need to change the source code?