Does a stacked LSTM share a weight matrix?

shamoons · March 9, 2020, 9:43pm

If I’m using an LSTM and set num_layers to some number other than 1, do all layers share the same weights?

IliasPap · March 9, 2020, 10:12pm

No each layer has different weights and biases

shamoons · March 9, 2020, 10:36pm

Is it possible to get a different hidden size for each layer?

IliasPap · March 10, 2020, 7:36am

The easiest way would be to create different LSTMs like in example

first = nn.LSTM(input_size=input_size1,hidden_size=hidden_size1,num_layers=num_layers1)
second= nn.LSTM(input_size=hidden_size1,hidden_size=hidden_size2,num_layers=num_layers2)

shamoons · March 10, 2020, 10:12am

Seems straightforward enough. Thank you. I wonder if there’s any advantage to doing this?