EarlyStopping warns that ‘val_loss’ is not found. I debugged a bit and saw that only callback_metrics (i.e. {'val_early_stop_on': None, 'val_checkpoint_on': None} are considered as early stopping metrics when EvalResult is used. I could set early_stop_on, but I would have to do that in validation_epoch_end to be able to calculate the mean, which would diminish the advantage of reduce_fx. I can also create an “improvment”-issue, if you want.