Multi binary-class classification

lima · October 15, 2021, 9:33am

Hello peeps,

I have some audio data for which i computed the audio features. The labels for each input feature is 20 binary classes. i.e. for each input element there are 20 classes (1 or 0).
My initial understanding is that this is a multi-label classification that can be addressed using nn.BCELoss.

Can you guys let me know if I got this right and to explain how nn.BCELoss works?

Thank you

nateanl · October 15, 2021, 11:02am

Hi @lima, nn.BCELoss is designed for binary classification task. The prediction and label are both of shape (batch, ...).
In your case, you have 20 classes, which is a multi-class classification task. You can use nn.CrossEntropyLoss where the prediction is a float tensor of shape (batch, class, ...) and the label is a long tensor of shape (batch, ...).

lima · October 15, 2021, 11:21am

Hi @nateanl thank you for your answer. but what about the fact that each class should either be 1 or 0?

nateanl · October 15, 2021, 11:42am

I see, if the input has more than one classes that are labeled as 1, then you should use nn.BCELoss and make your prediction and label both of shape (batch, class).

lima · October 15, 2021, 12:14pm

Hi @nateanl The idea is the following. I have computed the mfcc features for some audio data, and I have trained my model to do phoneme classification.
Now instead of the phoneme classification, I would like to do a phonetic feature classification (i.e. classify the individual phonetic features (total 20) for each audio frame). I believe nn.BCELoss can do the job, but what about the prediction and labels shape?

xdwang0726 · October 15, 2021, 12:21pm

If you are dealing with multi-label classification sigmoid + nn.BCELoss should be what you are looking for; and if you are looking for multi-class classification (each of your sample will belong to only one of the 20 classes) then softmax + nn.CrossEntropyLoss are the things.

lima · October 15, 2021, 1:05pm

Thank you @xdwang0726