Loss function for One-hot encoding

Krish · October 4, 2018, 1:39am

I am trying to implement MNIST with one hot encoding but cross entropy won’t work. What loss function should I be using?

InnovArul · October 4, 2018, 4:00am

what if you take argmax of the one hot vector and pass it to cross entropy loss?

Krish · October 4, 2018, 4:13am

But wouldn’t the argmax of every one-hot-encoded vector be 1?

InnovArul · October 4, 2018, 4:21am

No. argmax gives the index of max element

Krish · October 4, 2018, 11:59am

Taking the argmax worked. Thanks
But I was looking at the cifar-10 tutorial of Pytorch and it had an output layer of width 10 but the target was a scalar only. How does that work?

InnovArul · October 4, 2018, 12:26pm

the cross entropy loss function internally takes care of this.

Krish · October 4, 2018, 1:59pm

Can you please explain how that happens or point to any resource?