If I use trainer.train() and then save the model, is that still useful?

luisarmando · June 20, 2022, 4:08pm

I’m wondering if I run something with trainer vs model.train() and then save the model, is that still useful or do I need to save the trainer?

And then, how do I load it? as a trainer or as a model?

Slinae · June 21, 2022, 9:59am

Hi,

When the model. train() completed and the model saved with torch.save(model, ‘model.pth’) , I will use torch.load(‘model.pth’).

If I load the model from the pretrained model before using the trainer(), after fine-tuning, I will still use the from pretrained function to load the fine-tune model from the saved checkpoint.

You can test small models to see if both loading ways work or not.

luisarmando · June 23, 2022, 11:48pm

Do I need to do torch.save() ? I have no issues just asking because I was simply trying to do model.save_pretrained() (no available for a T5 model) and then did model.save() but nothing happened.

Right now I’m using the trainer in trainer.train() and I guess I can then save said model?

What I do is I load a pretrained one like so:

model = AutoModelForSeq2SeqLM.from_pretrained("t5-large")

Then I use the trainer like so:

trainer = Seq2SeqTrainer(
    model=model,
    args=training_args,
    train_dataset=train_tokenized_books,
    eval_dataset=eval_tokenized_books,
    tokenizer=tokenizer,
    data_collator=data_collator,
)

trainer.train()

Can I do afterwards torch.save(model, 'model.pth') like you mention or did I screw up somehow?

sgugger · June 24, 2022, 12:20pm

There is a trainer.save_model method that will save it for you in a format that is compatible with from_pretrained.

luisarmando · June 24, 2022, 12:42pm

ooh ok! Thank you!

I have another question if I may, what is the difference per se between using trainer vs model to train? as in trainer.train() vs model.train() is there any practical difference really? I’ve been looking at the docs but can’t quite figure it out.

Topic		Replies	Views
What is the purpose of save_pretrained()? Beginners	4	45938	August 12, 2021
How to save and load fine-tune model 🤗Transformers	4	24714	October 25, 2021
How to load model after running Trainer.save_model? Beginners	3	3152	November 28, 2023
How to save my model to use it later Beginners	16	176422	July 26, 2025
How to use fine-tuned model Beginners	1	307	April 27, 2021

If I use trainer.train() and then save the model, is that still useful?

Related topics