The same code for deep k-means on pytorch is much bad comapred with Tensorflow. Why?

I run deep K-means with pytorch, but the results is totally bad than original tensorflow v1 implementation. How to solve it?

The dkm GitHub: GitHub - MaziarMF/deep-k-means
My implementation: GitHub - gaokun12/dkm_pytorch