PyTorch Forums
DataParallel and DistributedDataParallel stuck at 100% GPU usage
distributed
rvarm1
(Rohan Varma)
July 11, 2021, 7:55pm
6
@FarisHijazi
Does the same stuckness issue occur when trying training with the Gloo backend?
show post in topic