How to implement a different version of BiLSTM

KiroSummer · March 10, 2018, 4:49am

Hello, everyone
I am a newer of Pytorch. I have a question, Pytorch’s BiLSTM is the structure that take the same input and run forward and reversed direction respectively. And then concatenate the two output of the forward and reversed direction LSTM as the BiLSTM’s output. Just as the picture below shows:

My question is, does the Pytorch support another BiLSTM has the structure that: the reversed LSTM take the forward LSTM’s output as it’s input. Just as the picture shows:

If the pytorch doesn’t support this kind of structure, how can i implement it myself? And how to support the use of pad_packed_sequence in Pytorch for batching.

thanks for your help!

jpeg729 · March 10, 2018, 10:49am

PyTorch LSTM has a bidirectional option. I haven’t used it so I am not sure how well it works.

KiroSummer · March 10, 2018, 10:54am

Pytorch LSTM’s structure is the above one. And I want a BiLSTM that use the below structure which different from the Pytorch LSTM. Thanks anyway.

jpeg729 · March 10, 2018, 10:56am

Your model could compute one layer at a time and reverse the output along the time dimension after each layer.
But supporting padded packed sequences will add a little complexity

KiroSummer · March 10, 2018, 11:02am

What I should do is reverse the actual length output (doesn’t contain the padding’s output). Is that right?