Does multi-gpu faster than single-gpu?

Alex_Hex · August 30, 2017, 9:02am

I trained a simple network on MNIST dataset with multi-gpu and single-gpu. But the multi-gpu version took 61 seconds, the single-gpu version took 18 seconds? Is it to do with data switching between diffetent gpu?

Martin_Mundt · August 31, 2017, 3:41pm

I would suggest trying something larger in terms of both model and data.

MNIST data is really tiny and your model is likely very small, so you are probably running into more overhead than advantage when using multiple GPUs.

I played around a little but I didn’t actually see much of an advantage unless I moved to large-scale datasets like ImageNet, Pascal etc. where models are larger as well.

Alex_Hex · September 1, 2017, 12:57am

Thanks for your reply, and I will try it.