Two networks with one loss. How is it possible?

S_M · October 30, 2022, 12:29pm

Hi all, I have two distinct network with same loss and two optimizers for each network. How can I backprop the loss to the networks?
Any help would be appreciated.

        optimizer_A.zero_grad()
        optimizer_B.zero_grad()
        loss.backward()
        optimizer_A.step()
        optimizer_B.step()

srishti-git1110 · October 30, 2022, 1:34pm

Hi,
This probably means the common loss is a function of the parameters of both the models.

You could then just create a single optimizer containing the parameters of both the models, like so:

optimizer = torch.optim.Adam([modelA.parameters()] + [modelB.parameteres()])

Let me know if this gives any errors.

S_M · October 30, 2022, 1:38pm

Hi @srishti-git1110, thanks for your tip, but my optimizers are two distinct optimizers with different hyperparameters.

srishti-git1110 · October 30, 2022, 1:55pm

In that case, is this code not working?

There’s also an option to create parameter groups within the same optimizer but I think that is equivalent to creating two different optimizers with different learning rates etc. (even for optimization algorithms like Adam that calculate running stats.)

S_M · October 30, 2022, 2:10pm

Is it true? I mean what is the acceptable code for this problem

srishti-git1110 · October 30, 2022, 2:22pm

Your code should work fine. Please post if you face any errors.