Training crashes due to - Insufficient shared memory (shm) - nn.DataParallel

The error message is the same as ERROR: Unexpected bus error encountered in worker. This might be caused by insufficient shared memory (shm)
When I set num_works=0, the code can run normall.

1 Like