GRU with different input_size and hidden_size

alishir · December 22, 2017, 6:29am

Is it possible to have GRU with different input_size and hidden_size?
According to documents x_t is the input for the first layer or the hidden state of the previous layer.

alishir · December 22, 2017, 6:49am

Unfortunately I didn’t read the documents carefully.
Yes, it is possible. Inputs flow in the network from first layer and for the next layer, output of first layer used as input.