DataParallel imbalanced memory usage

I am having a similar issue. Could this be because the loss calculation is not done in the forward function?

1 Like