The closure passed to the optimizer is None when using fp16

goku February 15, 2021, 7:01pm 4

looks like closures aren’t supported with 16bit precision training.
https://pytorch.org/docs/stable/amp.html#torch.cuda.amp.GradScaler.step