looks like closures aren’t supported with 16bit precision training.
https://pytorch.org/docs/stable/amp.html#torch.cuda.amp.GradScaler.step
looks like closures aren’t supported with 16bit precision training.
https://pytorch.org/docs/stable/amp.html#torch.cuda.amp.GradScaler.step