Dice loss for multiclass text classification 1D

sherouk_elsayed · March 11, 2022, 1:40am

I use bert model for multi level text classification (6 classes) batch_size=256
pred output for single post=[0.6 0.4 0.8 0.2 0.3 0.1] dim for batch=(256,6)
true output =2 for single post dim for batch=(256)
I want to use dice_loss so I found this code
from mxnet import nd, np

import numpy as np

smooth = 10

def dice_loss(y_pred, y_true):

product = np.multiply(y_pred, y_true)

intersection = np.sum(product)

coefficient = (2.*intersection +smooth) / (np.sum(y_pred)+np.sum(y_true) +smooth)

loss = 1. - coefficient

# or "-coefficient"

return (torch.tensor(loss, requires_grad=True))

but it seems for binary classification not multi one
so how to modify it to work for multi classification?

mMagmer · March 13, 2022, 12:15pm

github.com

qubvel/segmentation_models.pytorch/blob/master/segmentation_models_pytorch/losses/dice.py

from typing import Optional, List

import torch
import torch.nn.functional as F
from torch.nn.modules.loss import _Loss
from ._functional import soft_dice_score, to_tensor
from .constants import BINARY_MODE, MULTICLASS_MODE, MULTILABEL_MODE

__all__ = ["DiceLoss"]


class DiceLoss(_Loss):
    def __init__(
        self,
        mode: str,
        classes: Optional[List[int]] = None,
        log_loss: bool = False,
        from_logits: bool = True,
        smooth: float = 0.0,
        ignore_index: Optional[int] = None,

This file has been truncated. show original

this implementation is for image segmentation with 2 spatial dim.
but you should be able to use it by reshaping output of your model and traget to have size NxCxHxW and NxHxW where is W =1 .