[SOLVED] Class Weight for BCELoss

miguelvr · May 16, 2017, 5:05pm

Hey there,

I’m trying to increase the weight of an under sampled class in a binary classification problem.

torch.nn.BCELoss has a weight attribute, however I don’t quite get it as this weight parameter is a constructor parameter and it is not updated depending on the batch of data being computed, therefore it doesn’t achieve what I need.

What is the correct way of simulating a class weight, similar to the way Keras does?

Cheers

miguelvr · May 17, 2017, 1:19pm

Solved with a custom loss function:

def weighted_binary_cross_entropy(output, target, weights=None):
        
    if weights is not None:
        assert len(weights) == 2
        
        loss = weights[1] * (target * torch.log(output)) + \
               weights[0] * ((1 - target) * torch.log(1 - output))
    else:
        loss = target * torch.log(output) + (1 - target) * torch.log(1 - output)

    return torch.neg(torch.mean(loss))

htt210 · May 20, 2017, 5:04pm

According to the doc here
http://pytorch.org/docs/nn.html#bceloss
the weight parameter is a tensor of weight for each example in the batch. Thus, it must have the size equal to the batch size. You can set the weight at the beginning of each batch, for example:
criterion = nn.BCELoss() for batch in data: input, label, weight = batch criterion.weight = weight loss = criterion.forward(predct, label) ...

miguelvr · May 21, 2017, 2:20pm

That’s a neat solution as well… I ended up changing my loss function again to NLLLoss, which supports class weights, and it’s probably the easiest native solution. My custom loss was giving me NaNs towards the end of training and I have no idea why!

htt210 · May 22, 2017, 4:38am

Did you apply LogSoftmax before computing the loss. NLLLoss takes log probability as input, not the probability.

miguelvr · May 22, 2017, 9:25am

Yes, I did change the softmax to a log-softmax. The custom loss however, is working with a regular softmax, but I guess it could be related to the lack of an epsilon term to prevent the system to output a “hard one or zero”.

Luckick · November 15, 2017, 1:07am

Thank you. This works for me.

Alex_Choy · July 3, 2018, 10:48am

Ref to the c code, there is a safe_log function which returns log(1e-12) if the input is 0.

chuong_nguyen · July 17, 2018, 4:50am

That may be the numerical unstable. Applying clamp may help:

output = torch.clamp(output,min=1e-8,max=1-1e-8)  
loss =  pos_weight * (target * torch.log(output)) + neg_weight* ((1 - target) * torch.log(1 - output))

miguelvr · July 18, 2018, 7:49am

yes, that’s the easiest way… But you can simply use the NLLoss, it supports class weights now

zl535320706 · July 28, 2018, 7:35am

there should be a “-” in loss function

zl535320706 · July 28, 2018, 12:01pm

sorry I’m wrong, I ignored the torch.neg:sweat:

mohamed_ouftou · August 15, 2018, 6:41pm

Hello thanks for ur costum function i use it but i had this eror
The size of tensor a (32) must match the size of tensor b (2) at non-singleton dimension 1
can u help with this problem ?

Alex_Fann · March 17, 2019, 7:16pm

Just a quick question. When applying BCELoss with weights, do we need to normalize the weights with the batch size? Or raw weights would be fine?

David_Ruhe · April 7, 2019, 3:23pm

Hi Miguel,

I’m wondering how you used NLLLoss for a binary classification problem?

TheBloodthirster · April 22, 2019, 5:08pm

Thanks,I just don’t know how to use weight to join NNLLoss.

xulingzhi951118 · November 17, 2019, 1:09pm

I think ‘output = torch.clamp(output, 1e-9, 1-1e-9)’ to prevent the output from having 0.

Surabhi_Gupta · February 2, 2020, 9:37am

hey !

can you explain me how this works ?
I am working on classification problem on celebA dataset, in which, for some features there’s a huge imabalance. How can I rectify this issue with your above mentioned code ?

miguelvr · February 2, 2020, 10:16am

Just replace BCELoss with CrossEntropyLoss, which has a weight per class, it is probably the easiest solution.

Surabhi_Gupta · February 2, 2020, 11:52am

criterion = nn.CrossEntropyLoss(weight= torch.tensor([1, 20.4]).to(device))

Is this okay ?
Class1 samples=193140
Class2 samples=9459