Distilbert Seq2clas

natank · June 25, 2021, 1:46am

Hello

I have two questions:
1 We can view the. pooled layer by using output_hidden_states=true and follow the logic, but is there a generic way to do so?

2 When we are doing inference, can we output the hidden_states from trainer.predict() as we do by model(input/output_hidden_states=True)?

Thanks

pratikbhatia0011 · July 18, 2021, 10:59am

I have a similar doubt as regards point 2.

I am working on Question Answering with Distilbert.
The predict function in the Trainer does not work if output_hidden_states = True. It works fine if the same argument is set to False.
Is this a bug? If not, then how is one to use a model for prediction if one has set the argument output_hidden_states = True while initializing the model ?

pratikbhatia0011 · July 18, 2021, 12:01pm

Found the solution. Posting it here just in case someone else too gets stuck with my particular problem.

I just had to pass ignore_keys = [‘attentions’] in the predict function and everything works fine.

pratikbhatia0011 · July 18, 2021, 12:22pm

@natank To answer your second question, I don’t think so because as seen in the predict function source code, it only returns predictions, label_ids and metrics.

natank · July 19, 2021, 11:53am

Thanks for you replies. Regarding one it indeed resolved me a problem. Regarding your answer on q2. It seems that Huggingsface offer two working modes, research : use model , and model taining - use predict. Am I right?

Topic		Replies	Views
How to use encoded hidden_states as input to a Bert/DistilBert Model Beginners	0	334	June 19, 2023
Pool [CLS] token from DistilBERT 🤗Transformers	1	790	January 18, 2022
Model did not return a loss --- but why? 🤗Transformers	0	744	April 27, 2023
DistilBERT multiclass classification example 🤗Transformers	0	288	May 22, 2023
How to yield hidden_states from a saved, fine-tuned (distil)bert model? 🤗Transformers	2	401	July 12, 2020

Distilbert Seq2clas

Related topics