You’ll still need to pass labels
for training.
Training will be same as training any GPT-2 model, only difference is the attention_mask
You’ll still need to pass labels
for training.
Training will be same as training any GPT-2 model, only difference is the attention_mask