Run_qa.py for Tensorflow does not save the model in output directory

Andranik · February 17, 2022, 10:23am

I am very new to the topic so sorry if the question is dumb.

I am trying to train a Q/A model with example script for Question Answering for Tensorflow.
I use the following command to run the script:

python run_qa.py \
--model_name_or_path distilbert-base-cased \
--output_dir output \
--dataset_name squad \
--max_train_samples 40 \
--max_eval_samples 20 \
--do_train \
--do_eval \

And seems that everything goes fine and without errors, but after the script finishes the training I cannot find the trained model in output directory, what I see there are these two file eval_nbest_predictions.json and eval_predictions.json .
So what am I doing wrong? How can I make the script save the trained model in the output directory?

raok95 · May 29, 2022, 11:01pm

@Andranik I am facing the same outcome. Were you able to identify the root cause?

merve · June 3, 2022, 12:50pm

Hello there

I realized what was wrong, and you can open a PR to fix it as well.
The saving is done under this callback. The callback should’ve been instantiated and called inside model.fit() but it doesn’t. I’ll open a PR to fix that (but you can also use this quick fix).

Topic		Replies	Views
How do I get the model file after training is completed? 🤗Transformers	0	113	January 15, 2024
Model.save_pretrained is not saving .bin files! model.push_to_hub is not pushing my model in my HuggingFace directory! What am I missing? Help Beginners	11	4096	February 25, 2025
Inference problem after loading a fine tuned T5 model for seq2seq method in question answering Beginners	0	543	June 28, 2023
Saving the trained model "Trainer.save_model" error 🤗Transformers	0	368	December 7, 2023
Run_seq2seq_qa.py not storing predictions Beginners	0	225	January 31, 2023

Run_qa.py for Tensorflow does not save the model in output directory

Related topics