DDP training on RTX 4090 (ADA, cu118)

Seems to be AMD specific issue with multiple 4090s NCCL P2P functionality. Unsure who will resolve it when.

1 Like