Maybe some errors

stack · March 8, 2019, 2:23am

https://pytorch.org/tutorials/intermediate/seq2seq_translation_tutorial.html
I don’t think we should change the shape of input, just keep it (seq,batch)

ptrblck · March 9, 2019, 2:41pm

The input to nn.GRU is defined as [seq_len, batch, input_size], which is why the view is performed.

stack · March 9, 2019, 3:19pm

But the shape is 1 1 -1 which means 1 1 seqbatchhidden.this makes no sense

2019年3月9日星期六，ptrblck via PyTorch Forums noreply@discuss.pytorch.org 写道：

ptrblck · March 9, 2019, 4:32pm

The view operation reshapes the tensor to [1, 1, "remaining size"].
In case your tensor has 10 elements, embedded will be reshaped to [1, 1, 10].

stack · March 12, 2019, 12:55am

So the size is 1*1*seq_len*batch_size*hidden_size then the tensor will be fed into GRU. But GRU only accept tensor with size (seq_len, batch, input_size)

ptrblck · March 12, 2019, 1:19am

Basically yes, but since in the tutorial each word is fed ony by one, seq_len and batch_size will both be 1.