nn.DataParallel: TypeError: expected sequence object with len >= 0 or a single integer

From the stack trace it looks like the problem is with the outputs no?
Maybe your forward returns Tensors that are not on the right device?