How to fix a SIGSEGV in pytorch when using distributed training (e.g. DDP)?

I also play around with version of stuff. Python 3.8 pytorch 1.7.1 and cuda 10.2 (but have driver of at leat 11.0) seemed to work consistently for me.

Glad your python version change worked.