Multi-class cross entropy loss and softmax in pytorch

ptrblck · November 23, 2020, 12:17am

nn.CrossEntropyLoss expects raw logits in the shape [batch_size, nb_classes, *] so you should not apply a softmax activation on the model output. The class dimension should be in dim1 in the model output.