LightningModule load_from_checkpoint vs Trainer resume_from_checkpoint

mikemc · January 23, 2023, 7:20am

Hi, my LightningModule contains an attribute x which I ignore in save_hyperparameters. As expected, when I try using LightningModule.load_from_checkpoint, it asks to pass x in as kwargs. However, when I use the same checkpoint path as the value for resume_from_checkpoint in a trainer, I am not being asked for x, and everything appears to work just fine- it seems to “remember” x from somewhere else. Not to look a gift horse in the mouth, but I’d like to know- why do these two options behave differently? Thanks!

awaelchli · January 23, 2023, 10:36am

Hi @mikemc

This is because.load_from_checkpoint reinstantiates your model with the hyperparameters from the checkpoint. It creates a new instance by calling the constructor.

Trainer(resume_from_checkpoint=...) only loads the weights. Btw, in new versions it is Trainer.fit(ckpt_path=...).

At the time the loading is executed, the model is already instantiated:

model = ...  # this is the instance
trainer.fit(ckpt_path=...)  # load weights from checkpoint and resume

I’d like to know- why do these two options behave differently?

So in summary, they behave differently because they are not meant for the same use case:

The mechanism on the trainer is for resuming the training
The loading mechanism on LM is a convenience to load a model only (for example for testing, inference etc.).

Topic		Replies	Views
Best way to use load_from_checkpoint when model contains other models implementation help	1	2316	December 20, 2022
Resume_from_checkpoint not work Trainer	4	7215	December 7, 2022
Hparams not restored when using load_from_checkpoint (default argument values are the problem?) LightningModule	2	7072	October 8, 2020
Checkpointing and Restoring callbacks	0	223	April 7, 2023
Loading checkpoints when models built using a 'setup' block	7	3563	July 8, 2022

LightningModule load_from_checkpoint vs Trainer resume_from_checkpoint

Related topics