TorchElastic: Connection reset by peer

I ran into exactly the same error: connection reset by peer. May I ask how did you finally solve it?