Hi all, I have two distinct network with same loss and two optimizers for each network. How can I backprop the loss to the networks?
Any help would be appreciated.

There’s also an option to create parameter groups within the same optimizer but I think that is equivalent to creating two different optimizers with different learning rates etc. (even for optimization algorithms like Adam that calculate running stats.)