Runing ddp accross two machines

It works for me while I chang ethe LOCAL_RANK of the 2nd machine. Also, I put export CUDA_VISIBLE_DEVICES=0 separately on each machine.