Weight initialization across releases

Javier_Grau · April 10, 2023, 1:02am

I’m not sure if this issue has been posted before, but here it goes.

I’ve been training networks on two different machines with same running parameters, but getting very different results. The only different is that one machine I use version 1.9.0 and in the other 1.13.0.

I’m wonder what could be so different between these two releases that is producing such effect. Is the default weight initialization the same on these two versions?

Thanks

marksaroufim · April 10, 2023, 2:43am

Are you observing these changes with the same seed across releases, for example can you repro your results if you do torch.manual_seed(0)

Javier_Grau · April 10, 2023, 9:56am

Thanks for your answer.

I can’t do that unfortunately. The hardware in both machines belong to two different generations of gpus, hence very different cuda versions. That’s why the different pytorch versions.
But I am manually setting the same seeds with torch.manual_seed.