DDP Learning-Rate

mrshenli (Shen Li) July 8, 2020, 2:28pm 3

More discussions can be found at Should we split batch_size according to ngpu_per_node when DistributedDataparallel

1 Like