TorchElasticEnvironment¶
- class lightning.pytorch.plugins.environments.TorchElasticEnvironment[source]¶
Bases:
ClusterEnvironment
Environment for fault-tolerant and elastic training with torchelastic
- static detect()[source]¶
Returns
True
if the current process was launched using the torchelastic command.- Return type:
- global_rank()[source]¶
The rank (index) of the currently running process across all nodes and devices.
- Return type:
- local_rank()[source]¶
The rank (index) of the currently running process inside of the current node.
- Return type:
- validate_settings(num_devices, num_nodes)[source]¶
Validates settings configured in the script against the environment, and raises an exception if there is an inconsistency.
- Return type: