Did you find a solution? I am running into the same problem.
DDP Error: torch.distributed.elastic.agent.server.api:Received 1 death signal, shutting down workers
2 Likes
Did you find a solution? I am running into the same problem.