Hey @sangyx, when using DDP, you might need to tune the batch size and learning rate a bit. See the discussion below: