TypeError: cross_entropy_loss(): argument 'input' (position 1) must be Tensor, not SequenceClassifierOutput

xap · April 25, 2022, 9:04pm

I am doing sentence pair classification. I am using BertForSequence classification.

My model is as follows:

model = BertForSequenceClassification.from_pretrained(checkpoint, num_labels=5)

And my training loop looks like the below:


import numpy as np
EPOCHS = 5
criterion = nn.CrossEntropyLoss()

total_loss, total_accuracy = 0, 0

# empty list to save model predictions
total_preds=[]

for epoch in range(EPOCHS):
  model.train()
  total_train_loss = 0
  total_train_acc  = 0

  
  for step,batch in enumerate(train_dataloader):
    batch = [r.to(device) for r in batch]
    input_id,attention_mask,token_type_id,y = batch
    

    model.zero_grad()  
    
    prediction = model(input_id,attention_mask,token_type_id)

    loss = criterion(prediction,y)

    total_loss = total_loss + loss.item()



    loss.backward()

    torch.nn.utils.clip_grad_norm_(model.parameters(), 1.0)

    optimizer.step()

    
    preds=preds.detach().cpu().numpy()

   
    total_preds.append(preds)

  
    avg_loss = total_loss / len(train_dataloader)
  
  
    total_preds  = np.concatenate(total_preds, axis=0)

print(avg_loss)

When I train the model, I get the following error :

TypeError: cross_entropy_loss(): argument 'input' (position 1) must be Tensor, not SequenceClassifierOutput

I am not able to figure out what is wrong here. Any suggestions?

sgugger · April 26, 2022, 1:02am

Models of the Transformers library do not output tensors, see for example the quicktour. As such your prediction object is not a Tensor, as the error message tells you. prediction.logits is the tensor you are looking for, or you can pass along the labels to the model too to grab the loss in prediction.loss.

xap · April 26, 2022, 4:10pm

I am just a beginner on this so as per your suggestion is this what you suggest?

loss = criterion(prediction.logits,y)

Can you tell me how can I pass along the labels? I am confused about it.

Topic		Replies	Views
Class weights for bertForSequenceClassification Beginners	10	11993	May 29, 2022
Which loss function in bertforsequenceclassification regression Beginners	7	14852	February 25, 2021
Sentence Pair Classification Intermediate	1	1935	May 4, 2022
BertForSequenceClassification Index Error 🤗Transformers	1	2486	July 19, 2020
Inference result is SequenceClassifierOutput instance? 🤗Transformers	0	405	May 24, 2022

TypeError: cross_entropy_loss(): argument 'input' (position 1) must be Tensor, not SequenceClassifierOutput

Related topics