Testing best model after hyperparameter_search

BramVanroy · June 29, 2022, 10:43pm

I am trying to get my head around how to use Ray Tune’s PB2 scheduler alongside the Trainer. Specifically, how can you load the best/final model of a PBT-based scheduler and then continue to test/predict?

With grid search, we could do something like this:

best_params = trainer.hyperparameter_search(...)

# Set the trainer to the best hyperparameters found
for hparam, v in best_params.hyperparameters.items():
    setattr(trainer.args, hparam, v)

# Save optimal hparams
with output_dir.joinpath("opt_hparams.json").open("w", encoding="utf-8") as hp_out:
    dump(best_params, hp_out, indent=4, sort_keys=True)

# Now train the model from-scratch with the best hparams
train_result = trainer.train()
# ... and then get predictions from the model 
predictions = trainer.predict()

but because PB2 changes its hyperparameters on-the-fly, this won’t work because you cannot just do one training run with a single set of parameters.

So how do we typically go about this scenario? How do we load the final, best model after a PB-based hyperparameter-search, and then predict on the test set?

PS I asked this on the Ray forums but did not get a response unfortunately.

Topic		Replies	Views
Trainer.Hyperparameter_search() Trials did not complete. How to optimize parameters with ray tune? Beginners	0	937	January 10, 2023
[Ray] How to get the best model per trial 🤗Transformers	1	524	November 18, 2021
There is always something going wrong with hyper parameter tuning 🤗Transformers	4	1976	September 1, 2021
About Hyperparameter Search with Ray Tune 🤗Transformers	2	21	March 7, 2025
Inconsistency in hyperparameter search results 🤗Transformers	2	636	April 13, 2022

Testing best model after hyperparameter_search

Related topics