Thank you!
And Yes, your answer has be proved by my experiments.
I manually add a dropout layer after lstm, and it works well.
I have been stucked in this bug for a long time! the same data and the same config, it’s always overfitting with the pytorch version. Finally! Thanks!