Should we split batch_size according to ngpu_per_node when DistributedDataparallel

Good call, I will have to remember that! Thanks for your quick answer.

1 Like