MSE loss for multi-class problem

netaglazer · June 25, 2020, 9:08am

i have a multi class problem and i want to use MSE loss
i have weights, so the loss is:

def weighted_mse_loss(input,target,weights):
    out = (input-target)**2
    out = out * weights.expand_as(out)
    loss = out.sum(0)
    return loss

but now the loss is a tensor with 61 length (i have 61 classes)
and i get error:

“grad can be implicitly created only for scalar outputs”

KFrank · June 25, 2020, 4:00pm

Hi Neta!

As an aside, you probably don’t want to use MSELoss for a
multi-class problem.

out.sum(0) is summing over the batch dimension of your
input / target (even if your batch size is 1).

Given what you say, I speculate that input (the output of your
model) is a vector of 61 values, one for each class (for a single
sample in your batch), and that target is also a vector of 61
values (perhaps your class labels one-hot encoded).

If you want to do this (and your probably don’t), you should be
using out.sum() to sum over all elements of the out tensor,
that is, over both the batch and class dimensions.

Best.

K. Frank

netaglazer · June 25, 2020, 5:41pm

you mean i should return loss.sum() ?

KFrank · June 25, 2020, 11:39pm

Hello Neta!

What I really mean is that you should reorganize your problem a
little bit and use BCEWithLogitsLoss.

However, if you have a good reason to be using MSELoss (and I’m
right that your input and target have shape [nBatch, nClass]),
then, yes, you should return out.sum() (rather than out.sum (0))
so that you will be summing over classes as well as the samples in
your batch.

Best.

K. Frank