[Pytorch] Sorry for my terrible drawing. But according to the above figure, I want to use 2 separate losses to update different modules in the entire network. The data flow is illustrated with arrows in the above figure, I hope to define a loss1 to only update “trainable 1”, and another term loss2…

When you do backward pass, some of the values of the graph for the intermediate (non-leaf) nodes are freed. So, second pass over same nodes will trigger this error. To be able to proceed with second pass, you need to add parameter to loss1.backward(retain_graph=True)

How to handle two separate optimizers and separate losses?

vision

Programmer-RD-AI (Programmer-RD-AI) October 26, 2021, 7:53am 2

check the above discussions

1 Like