Is there a way to explicitly tell BERT to pay more attention to certain tokens given some prior information I have about the data?
Is there a way to explicitly tell BERT to pay more attention to certain tokens given some prior information I have about the data?