Why DataParallel works poorly for CNNs compared to LSTMs?

Have a look at this explanation of the general parallel operations.