Converting simple RNN model from Python to C++

pianoman8 · May 16, 2020, 11:39pm

I’m trying to convert the following simple model from Python to C++, and while the training loop works, I’m afraid I’m not handling the hidden state correctly as I’m not getting good results. Would someone mind checking my code?

Python:

class Net(nn.Module):
    def __init__(self, featurelen, outputlen, hwidth=None, nhidden=4, rnntype='lstm'):
        super(Net, self).__init__()
        self.hidden_width =12 if hwidth is None else hwidth
        self.nhidden = nhidden
        self.hidden = None

        self.rnn1 = {'lstm':nn.LSTM, 'gru':nn.GRU}[rnntype](featurelen,
                                            self.hidden_width,
                                            num_layers=self.nhidden,
                                            batch_first=True)
        self.dense1 = nn.Linear(self.hidden_width, outputlen)


    def forward(self, x):
        x, self.hidden = self.rnn1(x)
        return self.dense1(x)

And in C++:

struct myNet : torch::nn::Module {
	myNet(int input_size, int output_size,
			int hidden_width = 12, 
			int recursive_layers = 4) {

		recurrent = register_module("recurrent", 
				torch::nn::GRU(
					torch::nn::GRUOptions(input_size,hidden_width)
						.num_layers(recursive_layers)	
 				.batch_first(true)));

		output = register_module("output", 
					torch::nn::Linear(hidden_width, output_size));
	}

	torch::Tensor forward(torch::Tensor x) {
		std::tie(x, hidden) = recurrent->forward(x, hidden);
		x = output->forward(x);
		return x;
	}

	/* 
	 * See https://pytorch.org/tutorials/advanced/cpp_frontend.html
	 * for why nullptr.
	 */
	torch::Tensor		hidden;
	torch::nn::GRU 		recurrent{nullptr};
	torch::nn::Linear 	output{nullptr};
};

krrishabh · May 16, 2020, 11:49pm

This is a place where C++ Pytorch examples are given.
Link: https://github.com/prabhuomkar/pytorch-cpp/tree/master/tutorials
For RNN: https://pytorch.org/tutorials/advanced/cpp_frontend.html

I hope it helps

pianoman8 · May 17, 2020, 12:43am

Thank you. Those examples are some of the first online I’ve seen that are updated to 1.5.0+'s API for RNNs. It also reminded me my handing of the hidden state was slightly off,in that in the python code we has been doing other experiments where we were preserving the RNN hidden states so we could restart in the middle of a sequence. Your examples of course just ignore the hidden state which is what I want here. Link to your RNN example: https://github.com/prabhuomkar/pytorch-cpp/blob/master/tutorials/intermediate/recurrent_neural_network/src/rnn.cpp .

Note that the second link you gave is for a CNN, not an RNN (and I was already very familiar with it).

krrishabh · May 17, 2020, 12:55am

Sorry I was giving you this link : https://github.com/prabhuomkar/pytorch-cpp/tree/master/tutorials/intermediate/recurrent_neural_network

By mistake I have given u the wrong link.