CUDA out of memory when using Trainer with compute_metrics

What is the reason for only using the first element of logits and predictions?

1 Like