I tried in many ways but I could not get the solution. I am implementing GRU layer in a model in c.
I have extracted weights and biases. Can you give me the equations to compute the gates and outputs.
I am using hidden size = 32
num_layers = 1
batch_first = true
I followed the equations at https://pytorch.org/docs/stable/nn.html at GRU section.
My expected output is 151x32
Output is matched with just 1x32 rest 150x32 is not matching.
I am not able to know, what is called the time step in implementation time.
If possible suggest me any document about implementing GRU in testing mode.