I have something super weird going on. I have two networks that are initialized with the same values but use different parameters. When I do a forward pass using the same data the networks have different results!
You can see this in this jupyter notebook https://github.com/blester125/WeirdNotebook/blob/master/PyTorchUnstable.ipynb
Does anyone know what is going on?