Hi,
I extracted the GRU weights from the trained model and followed the instructions in pytorch document.
But, while comparing the results of model’s GRU outputs and c++ GRU output precision is missing by 0.1(maximum of all results). Can you suggest me to overcome this.
Advance thanks.
yf225
(PyTorch Dev, Facebook AI Research)
July 21, 2019, 8:18pm
#2
Could you post the full Python and C++ code for your experiment?
yf225
(PyTorch Dev, Facebook AI Research)
July 21, 2019, 8:29pm
#3
Yes, it is resolved. I made an small error in matmuls.
Thanks,
M. Dinesh
Can you give me suggestions on my current problem I am facing with bidirectional multilayered GRU?
[35610748-dd41a23a-06a5-11e8-803e-c577359c372b]
As per my understanding, GRU pytorch working as in the 2nd picture. After each layer, both forward and reverse directiona outputs are concatenated. Correct me if i was wrong.
The initial hidden state of first layer initialised with zeroes.
Is The last hidden state of 1st layer is the initial hidden state of the 2nd layer?.
And also I want to know, the second layer takes the concatenated 1st layer output as a input and operates on them, as …