How to input time distributed images into Conv2d layer?

I like to tag RNN on top of a CNN. I am having problem with finding the right approach to pass this data through CNN to RNN. Right now , I reshape input output of CNN by merging seq and batch dimensions. Is there any more native way to do it in Pytorch ?

I think what I need is something like Timedstributed layer of Keras.