How to use the system nccl library instead of the internal one of pytorch when using ddp?

Hey @ayl

You can export USE_SYSTEM_NCCL=1, and then compile PyTorch from source.

See this discussion Torch distributed not working on two machines [nccl backend]