How to view the changes in a model after training?

gildesh · November 9, 2021, 6:12am

Hello,

I trained a BART model (facebook-cnn) for summarization and compared summaries with a pretrained model

model_before_tuning_1 = AutoModelForSeq2SeqLM.from_pretrained(model_name)

trainer = Seq2SeqTrainer(
model=model,
args=training_args,
data_collator=data_collator,
train_dataset=train_data,
eval_dataset=validation_data,
tokenizer=tokenizer,
compute_metrics=compute_metrics,
)

trainer.train()

Summaries from model() and model_before_tuning_1() are different but when i compare the model config and/or print(model) it gives exact same things for both.

How to know, what exact parameters have this training changed?

nielsr · November 9, 2021, 11:25am

When fine-tuning a Transformer-based model, such as BART, all parameters of the model are updated. This means, any tensor that is present in model.parameters() can have updated values after fine-tuning.

The configuration of a model (config.json) before and after fine-tuning can be identical. The configuration just defines basic hyperparameters such as the number of hidden layers, the number of attention heads, etc.

gildesh · November 9, 2021, 11:34am

so i should check model.parameters() for the difference? thanks

nielsr · November 9, 2021, 11:38am

Yes, indeed.

gildesh · November 10, 2021, 12:40pm

I checked (model.parameters) and its coming exactly identical

Topic		Replies	Views
Question regarding training of BartForConditionalGeneration Models	1	2025	March 2, 2021
Does the model learn from the eval_dataset when fine-tuning? Beginners	2	282	May 4, 2022
What can cause model.generate (BART) output to be gibberish after fine-tuning? Beginners	3	4422	August 31, 2020
There seems to be a mistake in documentation (pretrained_models.html) regarding BART Site Feedback	2	645	October 26, 2020
Seq2SeqTrainer downloads different model on evaluation Beginners	0	278	August 18, 2021

How to view the changes in a model after training?

Related topics