In trainer, how to get something from model?

shelf · September 15, 2022, 11:44am

I customize Trainer’s compute_loss. Within the 'compute_loss`, I want to get two kinds of things from the model. The first kind is a config value, such as vocab_size. It is always a fixed value. The other kind are parameters, such as embeddings.

I find two models:

self.model. I can get self.model.config.vocab_size and self.model.bert.embedding.word_embeddings.
compute_loss’s arg model. In case of multi-gpu, I can’t access model.config.vocab_size and model.bert.embedding.word_embeddings because model is wrapped by ddp.

So what’s difference between self.model and model? How to get something from my model? Does self.model have up-to-date parameters?

Topic		Replies	Views
Custom model for Trainer 🤗Transformers	1	382	July 8, 2023
Couple of questions about Trainer Beginners	0	329	June 13, 2023
Different loss values for trained and saved model Beginners	0	273	April 14, 2023
How do i get Training and Validation Loss during fine tuning 🤗Transformers	2	14696	August 27, 2021
Track multiple losses & different outputs size with Trainer and callbacks 🤗Transformers	4	3106	July 11, 2024

In trainer, how to get something from model?

Related topics