The linked post is ~3.5 years old and the current PyTorch binaries use the nvidia-nccl-cu12
package from PyPI, so LD_PRELOAD
should work (or even manually replacing the pip wheel).
You can verify it via e.g. LD_DEBUG=libs
to see which NCCL library is loaded.
2 Likes