Hey @ayl
You can export USE_SYSTEM_NCCL=1
, and then compile PyTorch from source.
See this discussion Torch distributed not working on two machines [nccl backend]
Hey @ayl
You can export USE_SYSTEM_NCCL=1
, and then compile PyTorch from source.
See this discussion Torch distributed not working on two machines [nccl backend]