Hi, I’m trying to apply CNN to each image in a video. Currently, my implementation uses a for loop and torch.cat where I take each image and apply the CNN module in the loop. But clearly, this is sequential and I don’t see why it can’t be parallelized in theory since all images are independent from each other.
However, I’m not sure how this can be accomplished. I couldn’t find any built-in function for PyTorch. Is there a way to use matrix multiplication for something like this?
My video input shape looks like this: (batch_size, seq_len, channel, height, width) and CNN takes input shape of (batch_size, channel, height, width).
Thanks in advance for your help!