Spawned Processes with DDP

See discussion `torch.distributed.barrier` used in multi-node distributed data-parallel training

For more detailed explainations, cc @mrshenli