I have a network which is getting loss from three streams. I am adding them as the final loss like this:
loss = nn.BCEWithLogitsLoss(reduction=‘sum’)
lossf=loss(outputs1,labels1.float())/N+loss(outputs2,labels2.float())/N+loss(outputs3,labels3.float())/N
lossf.backward()
Now if I use two of the loss terms then each epoch is taking close to 5 mints. Anytime I add the third one it is taking like 10 minutes. For using two terms I didn’t change anything I just commented out the third term.
Any idea, why this is happening?