Are you shuffling the DataLoader?
If the length of your dataset is not divisible by the batch size without a remainder, you might see small differences in your validation accuracy. The last batch might be smaller, which creates a bias using your normalization (dividing by len(loader)).
Thank you very much, I solve the problem!!
Yes, I’m shuffling the DataLoader, and my dataset is not divisible by the batch size.
So, there are small differences in the validation accuracy.