Run multiple validation loops with different weights

AShedko · July 20, 2023, 9:40am

I would like to compare the performance of my model with different variations of EMA(exponential moving average of weights) and regular weights. With pre-2.0 lightning it was possible to extend the TrainingEpochLoop to achieve this.
Are there any options for this besides writing a custom trainer?

What I’ve come up with on my own:

Copying the model and trainer inside a callback and running cloned_trainer.validate(model_with_ema_weights, val_dataloader) - Requires one to reconnect the loggers and other experiment tracking features to the cloned trainer. Also wastes memory (briefly requires 2 copies of the weights to be loaded)
Using val_check_interval=1.0/num_validations and using on_validation_start to modify the weights in-place - The validation runs are not directly comparable because the model has seen
1.0-1/num_valuations more training data on one validation than on another.

Running multiple validations at the end of the training is an option but is it possible to have that information during the run?

seastar105 · March 13, 2024, 11:48am

Have you get nice solution for problem? i have same problem, wanna run validation loop on both ema weight, and training weight

Topic		Replies	Views
Running multiple validation steps after each training epoch implementation help	1	676	December 16, 2023
Training inside Validation/Testing Loop Trainer	1	1046	February 22, 2021
Multiple train/validation dataloaders/multiple evaluation metrics implementation help	1	7570	November 2, 2020
Custom validation frequency implementation help	7	1015	November 11, 2022
Run validation loop and callback before training Trainer	3	705	December 18, 2023

Run multiple validation loops with different weights

Related topics