Combine BertForSequenceClassificaion with Additional Features

See this response where I explain how to modify BERT to add additional POS (part-of-speech) features to tokens to perform named-entity recognition.