Does HuggingFace finetune the RobertaForSeqClassification head for sentiment prediction? In other words, is the SeqClassification head (i.e. additional layers over BertModel) trained by HuggingFace?
I am asking this to know why the SeqClassification head does so good for sentiment task?