DataParallel caching replicate()

srama2512 · November 5, 2017, 2:23am

Is there a way to perform distributed data parallelism within a single node across multiple GPUs? DataParallel copies the model each time and that seems to slow things down significantly for me.

ruotianluo · November 5, 2017, 3:41am

Doesn’t distributed data parallelism also synchronize?

srama2512 · November 5, 2017, 3:57am

Yes. Right. A synchronized version of DataParallel without model replication during each forward pass. How can that be done?