Hi! I was working on an ASR model in tensorflow keras, but now I want to swich to pytorch. I’m trying to reimplement keras model in pytorch, but I think, I did a mistake, because the same model on the same data does not learn in pytorch.
Here is a full jupyter notebook of my problem: notebook on github
As You can see, the TF model overfits the random data, as expected, but the pytorch model does not learn anything.
I’m using pytorch 1.1.0 with CUDA, and Tensorflow 2.0.0-beta1