Greetings. I am getting confused about the input shape to GRU layer.
I have one batch of 128 images and I extracted 9 features from each images.
So now my shape is
Here 1 is batch, 128 images in each batch and 9 features of each images. The images is in sequence, for example 128 frame of a video. So simply one batch represent one video.
This is the GRU layer
Question 1: Is the
input_size=128 correctly defined?
Here is the code of forward function
def forward(features): features=features.permute(0,2,1)#[1, 9, 128] x2,_=self.gru(features)
Question 2: Is the code in forward function is correctly defined?
Question 3: In shape
(1,128,9), 9 is sequence length and 128 is the input. Is this correct or its the opposite way?