How to transfer pretained model from Tensorflow to PyTorch?

yaluguo · August 13, 2017, 1:41pm

I have build exactly same model in both TF and Pytorch. And I trained in TF. For some reason, I have to transfer the pretrained weight to Pytorch.

The network is like:

In TF, Conv2d filter shape is [filter_height, filter_width, in_channels, out_channels], while in Pytorch is (out_channels, in_channels, kernel_size[0], kernel_size[1]).

So I have done below in TF:

and I transfer to pytorch like:

It turns out that the DQN in pytorch is not working well as in TF!

Sanchit_Chiplunkar · September 18, 2017, 12:10pm

Hey i wanna do the same kind of thing too did you find a solution. or is there a document from which i can take inspiration .
Thanks

smth · September 28, 2017, 3:46pm

See https://github.com/Cadene/tensorflow-model-zoo.torch for some hints / mechanisms.

gcucurull · January 27, 2018, 4:01pm

In case someone else gets here and has the same issue, I think that the problem is using reshape before transpose.
I have loaded TF weights with PyTorch by permuting the weight Tensor, and it worked fine.

Esam · July 17, 2018, 5:13pm

This was very helpful tip, thanks