Get warning "Could not estimate the number of tokens of the input, floating-point operations will not be computed" when use a customize Trainer and customize data collator

mzyMMMM · May 30, 2022, 9:01am

I want to use triplet loss to fine tune my Bert model, so I define a customize Trainer like that:

class TripletLossTrainer(Trainer):
        def compute_loss(self, model, inputs, return_outputs=False):
            anchor_data, positive_data, negative_data = inputs.values()
            anchor_output = self.use_avg_2(anchor_data, model)
            positive_output = self.use_avg_2(positive_data, model)
            negative_output = self.use_avg_2(negative_data, model)
            triplet_loss = nn.TripletMarginWithDistanceLoss(distance_function=lambda x, y: 1.0 - 
            F.cosine_similarity(x, y),margin=0.5)
            # compute custom loss
            loss = triplet_loss(anchor_output, positive_output, negative_output)
            return loss

where anchor_data, positive_data, negative_data are tokenized result with type <class ‘transformers.tokenization_utils_base.BatchEncoding’>, and the use _avg_2 function is for computing the average value for the last hidden state for each data with a Bert model.
When I do the training process, it will keep reporting warning like
Could not estimate the number of tokens of the input, floating-point operations will not be computed
Could not estimate the number of tokens of the input, floating-point operations will not be computed
Could not estimate the number of tokens of the input, floating-point operations will not be computed

I wonder what does the floating-point operations mean? will it influence the performance of my training? Thanks!

sgugger · May 31, 2022, 7:04pm

You can completely ignore that warning, it’s just to compute the total FLOPS done during your training.

mzyMMMM · June 1, 2022, 1:17am

Thanks a lot! Is there any method to stop outputting these warn information during the training?

sgugger · June 1, 2022, 1:54pm

No, but in the latest version you should see it only once.

Reza0247 · September 30, 2022, 7:24pm

Hi, I have the same problem when using the framework of huggingface to compute a contrastive loss. In my case, this warning information is repeated in the console which really makes me a big headache.

The version of my huggingface is 4.17.0. So how can I stop this information?

jxm · November 30, 2022, 9:50pm

Me too, on 4.23.1:

Topic		Replies	Views
Implementing Triplet loss in Vit 🤗Transformers	1	29	June 3, 2025
Bug in gradient accumulation training_step in huggingface Trainer? 🤗Transformers	3	811	November 2, 2024
Couple of questions about Trainer Beginners	0	329	June 13, 2023
How do I backpropagate specific output tokens using Trainer? Intermediate	0	37	December 25, 2024
Multi-GPU support lost when overwriting functions for Custom Trainer Intermediate	1	648	March 5, 2023

Get warning "Could not estimate the number of tokens of the input, floating-point operations will not be computed" when use a customize Trainer and customize data collator

Related topics