HorovodStrategy
- class pytorch_lightning.strategies.HorovodStrategy(accelerator=None, parallel_devices=None, checkpoint_io=None, precision_plugin=None)[source]
Bases:
pytorch_lightning.strategies.parallel.ParallelStrategy
Plugin for Horovod distributed training integration.
- all_gather(result, group=torch.distributed.group.WORLD, sync_grads=False)[source]
Perform a all_gather on all processes.
- Return type
- barrier(*args, **kwargs)[source]
Synchronizes all processes which blocks processes until the whole group enters this function.
- broadcast(obj, src=0)[source]
Broadcasts an object to all processes.
- model_to_device()[source]
Moves the model to the correct device.
- reduce(tensor, group=None, reduce_op='mean')[source]
Reduces a tensor from several distributed processes to one aggregated tensor.
- Parameters
- Returns
reduced value, except when the input was not a tensor the output remains is unchanged
- setup(trainer)[source]
Setup plugins for the trainer fit and creates optimizers.
- teardown()[source]
This method is called to teardown the training process.
It is the right place to release memory and free other resources.
- Return type
- property handles_gradient_accumulation: bool
Whether the plugin handles gradient accumulation internally.
- Return type