I want to use high-order gradient with LSTMCell. However, this function is not supported. So I want to implement the cell with linear layer. Also, I want to initialize my lstm cell to a well-trained lstm cell.

But I cannot find the order of the gates. The following is my implementation. I am wondering what is the order of gates in pytorch LSTMCell implementation. Thank you.

The order within the parameters is the same as for LSTM, so you can consult the Variables section in the LSTM documentation.

Best regards

Thomas