DDPShardedStrategy¶

class pytorch_lightning.strategies.DDPShardedStrategy(*args, **kwargs)[source]¶

Optimizer and gradient sharded training provided by FairScale.

block_backward_sync()[source]¶

Blocks syncing gradients behaviour on backwards pass.

This is useful for skipping sync when accumulating gradients, reducing communication overhead Returns: context manager with sync behaviour off

connect(model)[source]¶

Called by the accelerator to connect the accelerator and the model with this plugin.

pre_backward(closure_loss)[source]¶

Run before precision plugin executes backward.

setup(trainer)[source]¶

Setup plugins for the trainer fit and creates optimizers.