Using torch.nn.DataParallel with torch.distributions.Laplace throws TypeError: 'Laplace' object is not iterable

Jimmy2027 · November 11, 2020, 7:54pm

I am trying to train my model on multiple GPUs but I have some trouble with torch.distributions.Laplace that I call in the forward pass.
I have uploaded a minimal working example that runs fine without torch.nn.DataParallel but fails when using it.

Is there any way to make this code run on multiple GPUs?

Kushaj · November 11, 2020, 8:02pm

I wasn’t able to reproduce the error. On which pytorch version are you? I tested on 1.6.

Jimmy2027 · November 11, 2020, 8:18pm

hmm interesting, me too I am on 1.6.0. I checked again and found out that at least two GPUs need to be available to reproduce the error

Kushaj · November 11, 2020, 8:20pm

That was dumb of me. I forgot on single GPU the code behaves identical.

Kushaj · November 11, 2020, 8:23pm

Is there any specific reason for using DataParallel instead of DistributedDataParallel? I have experience with only single GPU machines so I don’t know about the details here.

Jimmy2027 · November 11, 2020, 8:28pm

no particular reason, I have just seen more examples using DataParallel
But could be worth trying out if things look differently with DistributedDataParallel

Jimmy2027 · November 11, 2020, 8:49pm

DistributedDataParallel seems to work without problems, thanks for the hint
I have uploaded a gist showing how the code now runs with world_size=2