Training crashes due to - Insufficient shared memory (shm) - nn.DataParallel

Andybert · August 4, 2019, 10:50am

The error message is the same as ERROR: Unexpected bus error encountered in worker. This might be caused by insufficient shared memory (shm)
When I set num_works=0, the code can run normall.