Convolutional LSTM

rogetrullo · April 11, 2017, 2:31am

Hi guys, I have been working on an implementation of a convolutional lstm.
I implemented first a convlstm cell and then a module that allows multiple layers.
Here’s the code:

github.com

rogertrullo/pytorch_convlstm/blob/master/conv_lstm.py

import torch.nn as nn
from torch.autograd import Variable
import torch

def weights_init(m):
    classname = m.__class__.__name__
    if classname.find('Conv') != -1:
        m.weight.data.normal_(0.0, 0.02)
    elif classname.find('BatchNorm') != -1:
        m.weight.data.normal_(1.0, 0.02)
        m.bias.data.fill_(0)

class CLSTM_cell(nn.Module):
    """Initialize a basic Conv LSTM cell.
    Args:
      shape: int tuple thats the height and width of the hidden states h and c()
      filter_size: int that is the height and width of the filters
      num_features: int thats the num of channels of the states, like hidden_size
      
    """

This file has been truncated. show original

It’d be nice if anybody could comment about the correctness of the implementation, or how can I improve it.
Thanks!

melody-rain · April 11, 2017, 2:53am

there is no need to implement LSTM by yourselt. In your forward() of CLSTM_cell, just input the output of conv to nn.LSTM. Like the code below:

        x = self.CNN(x)
        x = x.view(x.size()[0], 512, -1)
        # (batch, input_size, seq_len) -> (batch, seq_len, input_size)
        x = x.transpose(1, 2)
        # (batch, seq_len, input_size) -> (seq_len, batch, input_size)
        x = x.transpose(0, 1).contiguous()
        x, _ = self.LSTM1(x)

You should define your self.LSTM1 in your init, like

self.BiLSTM1 = nn.LSTM(input_size=nIn, hidden_size=nHidden, num_layers=1, dropout=0)

Also refer to the definition of nn.LSTM for how to use.

rogetrullo · April 11, 2017, 3:00am

Hi,
I think this is different, I am trying to do something similar to what is presented in this paper.
Here the input is an image, and the states are also multichannel images. The input to hidden and hidden to hidden operation are convolutions instead of matrix vector multiplications…
In your code you just convert the output of a CNN to a vector and use the regular LSTM.

hongyuan · April 11, 2017, 7:42am

yeah, I am agree with you. Have you test that? Dose it works? I think LSTM may have too many parameters, GRU may works better?

alan_ayu · September 5, 2017, 3:37pm

Hi !
You have done a great work, I am also interested in CLSTM and want to do something using it.
I don’t know how it run in your machine, but I can’t run your code directly, so I rewrite some parts and it can run well with these changes, I changed the loop in CLSTM.forward to:

for idlayer in xrange(self.num_layers):
    hidden_c=hidden_state[idlayer]
output_inner=[]

for t in xrange(seq_len):
    hidden_c=self.cell_list[idlayer](current_input[:,t,:,:,:],hidden_c)
    output_inner.append(hidden_c[0].unsqueeze(1))

next_hidden.append(hidden_c)
current_input=torch.cat(output_inner,1)

Does these changes conflict with your original intension?

rogetrullo · September 9, 2017, 4:37pm

Hi alan, could you tell me whats the error you are having with the original code?
I will check your changes to see if they do the same.

alan_ayu · September 10, 2017, 1:38pm

The most obvious error is that the features map size are not compatible,for example I can’t use torch.cat to concatenate input image and hidden states successfully.

rogetrullo · September 10, 2017, 4:27pm

Thanks @alan_ayu!
There was indeed an error in the input format (batch, seq_len,…). It happened because I used the right format in my own code, and I put a wrong one in GitHub. Could you please check again? Let me know if you still have any issues.

QuantScientist · September 10, 2017, 7:54pm

There is also this model:

github.com

Atcold/pytorch-CortexNet/blob/master/model/ConvLSTMCell.py

import torch
from torch import nn
import torch.nn.functional as f
from torch.autograd import Variable


# Define some constants
KERNEL_SIZE = 3
PADDING = KERNEL_SIZE // 2


class ConvLSTMCell(nn.Module):
    """
    Generate a convolutional LSTM cell
    """

    def __init__(self, input_size, hidden_size):
        super().__init__()
        self.input_size = input_size
        self.hidden_size = hidden_size

This file has been truncated. show original

alan_ayu · September 13, 2017, 8:07am

The net now can work well !!!