I had to use tensorrt but the tensor unfold operation was not supported and the output of the converted model was rubbish.
So , I wrote the unfold with simple operations (forum thread) but it is extremely slow.
Do you think python’s for loops is causing this?