@ndronen @smth Do you think it should be like this one? suppose that I want to update the weight of CNN during LSTM training.
Because some peoples said it should be like this (for example done in keras).
Is there something like TimeDistributed
in Pytorch?, or we shouldn’t do it anymore because figure 1 is true