How can I get returns from a function in distributed data parallel?

Hi, you can use

  1. torch.multiprocessing.SimpleQueue to let the child processes to put the results in the queue.

  2. point-to-point communication functions to send tensors between different distributed processes.

You may want to refer to this thread for more explanation.