Initial hidden state of GRU when testing the real data

Hou_ZeYu · December 24, 2020, 3:45am

I try to understand how GRU work in the model.
I already read the GRU document.
I think I known how it work in the forward process.
Here is my example for demo forward follow blow.
001
002

For each input data, I need to initialize a hidden state.
In training process, I can randomly initialize it.
After finishing training process, if I want to test a real data, how to determine the initial hidden state?
Still by randomly choose? Or I make some mistake about forward of GRU?

Abhilash_Srivastava · December 24, 2020, 7:59am

Use the same method for both train and test (random initialization in this case).

Hou_ZeYu · December 24, 2020, 9:04am

The same input data, different initial state, will get the different result, is it normal?

Abhilash_Srivastava · December 25, 2020, 1:44am

Can you elaborate on what you exactly mean?

Did you mean that the same model after training is used with different initial states?
Did you mean training with different initial states?