Fine tune BERT for NER task

Have any of you ever encountered this problem? It is not always happen, but if the sample size is over 1000, the error pops out.