Different results on CUDA and CPU

It seems that the problem is related to prediction (not to model training):

  1. I trained the model on CUDA
  2. Saved the model to file
  3. Loaded it on CPU
  4. And I see the same problem: class ‘5’ for first images