KL Divergence produces negative values

MFajcik1 · January 20, 2020, 10:02am

Did you normalized values with log_softmax?

torch.nn.KLDivLoss(size_average=False)(F.log_softmax(scores, -1), targets)