DataParallel and DistributedDataParallel stuck at 100% GPU usage

@FarisHijazi Does the same stuckness issue occur when trying training with the Gloo backend?