Drastically different inference results on different machines?

You mentioned you’ve installed the same PyTorch versions and use the CPU now?
Make sure to use the same versions before staring to debug.

Also, I would recommend to use the lastest stable version (1.4) as well as Python3, as we’ve had a weird issue with Python2.7 recently (link).