Is there a Pytorch implementation of the ‘cross convolution’ layer used in the Visual Dynamics paper? Specifically, I would like to implement a network that predicts a set of conv filters (i.e. their weights are the network output) that are later convolved with an image. I could imagine doing something like