The acc in multi gpus is lower than one gpus

Hey @sangyx, when using DDP, you might need to tune the batch size and learning rate a bit. See the discussion below: