[Solved] Character-Level RNN generate names?

danche354 · June 11, 2017, 10:35am

In the tutorial http://pytorch.org/tutorials/intermediate/char_rnn_generation_tutorial.html. Why the same input could produces different output? After training, the model parameters supposed to be fixed, isn’t it?

spro · June 11, 2017, 9:14pm

It looks like you solved it already but the reason this happens is the dropout layer adding some randomness. The dropout could be “turned off” to make the model deterministic with rnn.train(False) in the sample function.

huanghuang · September 5, 2017, 8:09am

Hi, spro, I’m a begginner of DL and Pytorch, I cannt understand why the net graph in http://pytorch.org/tutorials/intermediate/char_rnn_generation_tutorial.html is that, I donnt think the previous output is the next step input, could you tell me why? Thanks!

spro · September 5, 2017, 11:53pm

When generating, the previous output is the next input, however when training, the correct output is the next input. This training technique is known as “teacher forcing” - look that up for more on why it’s used.

huanghuang · September 6, 2017, 9:05am

Thanks very much for your explanation!