ncclInvalidUsage of torch.nn.parallel.DistributedDataParallel

I sove this problem by change
net.to(f'cuda:{args.local_rank}')

1 Like