BCELoss vs BCEWithLogitsLoss

Yes, you are completely right and I’ve mixed up both names. I’ll edit my post to get it right.

In the example the nn.Sigmoid will be applied by m(input), so it should be right.
Your code should apply the sigmoid function twice (once in m() and the second time using torch.sigmoid).

4 Likes