I’m just running some initial tests/experiments in train_unconditional.py
, to get a sense of how my data/images are going to work with Diffusers and I’d like to continue training from a saved checkpoint. I modified the script to use:
if args.overwrite_output_dir:
model = UNet2DModel(
// original model code
)
else:
model_dir = args.output_dir + '/unet/'
model = UNet2DModel.from_pretrained(model_dir)
which does appear to successfully load my checkpoint (based on the initial loss). However, when I run it, I see that the loss is clearly diverging. I’m guessing I don’t have the correct scheduler/optimizer settings, but I’m not sure how to load those.
Any tips appreciated.