I extracted the GRU weights from the trained model and followed the instructions in pytorch document.
But, while comparing the results of model’s GRU outputs and c++ GRU output precision is missing by 0.1(maximum of all results). Can you suggest me to overcome this.
Could you post the full Python and C++ code for your experiment?
Yes, it is resolved. I made an small error in matmuls.
Can you give me suggestions on my current problem I am facing with bidirectional multilayered GRU?