The model is not learning

tony2037 (Ztex) September 22, 2019, 1:42am 3

I think I’ve found the problem.
According to: Why "loss.backward()" didn't update parameters' gradient?
Add a batch normalization layer could solve this.