Resuming from model checkpoints produces different training loss

pinocchio · May 2, 2020, 3:39pm

How do you restore so that that dataloader also restores from the right batch.

This seems important for reproducible research.