Using transformers (BERT, RoBERTa) without embedding layer

Something like that could be a good starting point for you: