Additional features in token classification

Hi,

I would like to use additional input features in the token classification task (BIO Tags) using XLM-roberta.
These features are:

  1. relative word frequency in analyzed document (one number for a word)
  2. vector representating the entire document (vector of size 768)

What is the best way to do this?