Initialize hidden layer in RNN network

mattinjersey · August 13, 2018, 2:00pm

Hello, I read similar topic in initializing hidden layer in RNN network.
However they are quite confusing for me.
Right now I have the code as follows to initialize hidden layer with zeros. Could explain to me how to modify it so that it is initialized based on training? Thx Matt

def initHidden(self):
return torch.zeros((self.MiniBatchSize,self.HiddenNodes),dtype=torch.double)

al3x · August 13, 2018, 2:18pm

Do you mean you want to treat your initial hidden state as a learnable parameter? Wrap it in nn.Parameter:

class RNN(nn.Module):
    def __init__(self, ...):
        ...
        h_0 = torch.zeros((self.MiniBatchSize, self.HiddenNodes),
                          dtype=torch.double)
        self.h_0 = nn.Parameter(h_0)
        ...

mattinjersey · August 13, 2018, 2:20pm

Will I still use init_hidden between minibatches?

al3x · August 13, 2018, 2:30pm

You wouldn’t want to if you are using a stateful RNN at least, where you pass in the hidden state output from the previous minibatch.

mattinjersey · August 13, 2018, 2:54pm

Thanks- I’ll try the approach!

mattinjersey · August 13, 2018, 3:18pm

How will I call the function. Previously I wrote:
hidden=model.initHidden()
outA, hidden=aModel(input, hidden)

mattinjersey · August 13, 2018, 3:37pm

Maybe I don’t call it anymore. Just use self.hidden internally and call outA=model(Input).
I still need to do this detach call I think, between minibatches I would type model.hidden_0.detach()

al3x · August 13, 2018, 8:40pm

Something like this perhaps?

def forward(self, data, hidden):
    if hidden == None:
        hidden = self.h_0
    ...

mattinjersey · August 13, 2018, 8:59pm

Could you review the entire code (psueducode).

class RNN(nn.Module):
    def __init__(self, ...):
        ...
        h_0 = torch.zeros((self.MiniBatchSize, self.HiddenNodes),
                          dtype=torch.double)
        self.h_0 = nn.Parameter(h_0)
        self.Lin1 = nn.Linear(InNodes , NumNodes)
        self.Lin2= nn.Linear(InNodes , NumNodes)
   def forward(self, data, hidden):
     if hidden == None:
         hidden = self.h_0
     aDat=torch.cat((data,hidden))
     kOut=Lin1(aDat)
     hidden=Lin2(aDat)
     return kOut,hidden

aModel=RNN()
hidden=None
while True:
     aData, aTruth=Minibatch
     for count in range(numSteps):
         kOut,hidden=aModel(aData[count],hidden)
     loss=criterion(kOut,aTruth)
     hidden.detach()

mattinjersey · August 13, 2018, 9:01pm

I dont know how to format the code on this webpage.

ptrblck · August 13, 2018, 9:46pm

I’ve formatted your code.
You can add code with three backticks before and after the code block (```).

mattinjersey · August 13, 2018, 9:49pm

I have to say people on this forum are quite delightful.