PyTorch distributed example code hang. Deadlock?

Hey @Hao_Yuan

A few things could cause this issue.

  1. 127.0.0.0 is not a valid IP in some envs. Could you please check that. If it is indeed invalid, can you try 127.0.0.1 or localhost or other valid IP addresses?
  2. Some other process is occupying that port. Can you try a different port number?