Distributeddataparallel across two machine with different version of Pytorch and CUDA

kolohe113 · May 9, 2021, 7:31pm

Hello, I am trying to set up distributed learning across two laptops using distributeddataparallel. One computer has CUDA 10 and the other one has CUDA 11. Will this create problem during training?

ptrblck · May 10, 2021, 3:41am

I haven’t tried mixing different CUDA and NCCL versions in DDP runs, but guess it might yield unexpected behavior or other issues. Especially, if you are also mixing different PyTorch releases.