Problems Minimising Multi-task Loss

CCL · February 6, 2020, 5:52am

Here, I’ve declared a custom multi-task loss in Pytorch with BCEWithLogitsLoss for the (binary) mask segmentation loss, and BCELoss for the classification loss (I use fully-connected layers and then sigmoid). I weight BCEWithLogitsLoss at 0.9 and BCELoss at 0.1, sum them, and back propagate this summed loss. I’m using the Adam optimiser for minimisation. Please note that the y-axis of the plot is wrong (should be multi-task loss instead of just BCEWithLogits).
plot
However, as seen from the graph, this loss cannot be minimised well. Why do you think this is? How do I solve this problem? Thanks!

ptrblck · February 6, 2020, 6:38am

Are you using the sigmoid for both layers?
Note that nn.BCEWithLogitsLoss does not expect probabilities, but raw logits.
Could you post the training code, so that we can have a look?

CCL · February 6, 2020, 6:40am

Ah yes you are completely right! I’ll fix that!