I am new to LSTM and PyTorch’s implementation of LSTM using
torch.nn.LSTM() has confused me further.
I am implementing an LSTM model for predicting the speeds of different frames (https://arxiv.org/pdf/1801.06734.pdf).
The paper does not mention much about the LSTM part of the model. All it says is that I have to consider the “speeds of 10 previous timestamps”. Also, in the image of the architecture, they have mentioned “LSTM 128”.
What should my input_size, hidden_size and num_layers be? What is “128” here?
Since I have essentially only one feature, i.e., speed, I am guessing the input_size to be 1. Is this correct?
I can have the seq_len to be 10 (for 10 previous timestamps).
Is this correct or am I missing something here?
If that is correct, what will the hidden_size, batch_size and num_layers be?