How to finetune cola dataset using trainsformer and pytorch?

Betacat · March 30, 2022, 5:34pm

I am trying to use RobertaModelForSequenceClassification as my backbone and pytorch.DistributedDataParallel to train dataparallel.
My qusetions are as follows.

metric Matthews correlation is used for training and evaluating or just evaluating? Is the loss function of cola dataset nn.Crossentrophy or Matthews correlation?
what should I input to the model? Is these code below ok?

  train_dataset.set_format(type='torch', columns=['input_ids','labels','attention_mask'])
  val_dataset.set_format(type='torch', columns=['input_ids','labels','attention_mask'])

In robertaforsentenceclassification source code
transformers/modeling_roberta.py at 198c335d219a5eb4d3f124fdd1ce1a9cd9f78a9b · huggingface/transformers · GitHub
are all attention_mask are the same after input to each layer_module in the loop of robertaencoder?
If you could give me some pyotrch&huggingface code without using trainer in huggingface, that would be so great!

Topic		Replies	Views
Finetune xlm roberta base(overfitting ,any solution ) Beginners	3	447	December 26, 2023
Error while training a custom hugging face RoBERTa Models	0	88	June 26, 2024
Finetuning Transformers for Text Classification Issue 🤗Transformers	2	706	May 11, 2023
Finetuning with Trainer doesn't seem to learn since second epoch Beginners	3	2414	January 19, 2023
Fine-tuning XLM-RoBERTa for binary sentiment classification Beginners	1	1433	November 4, 2021

How to finetune cola dataset using trainsformer and pytorch?

Related topics