The formula in the docs is the negative log softmax written as:
- log ( exp(x[class]) / sum(exp(x[j]))
x
are the logits here while exp()/sum(exp())
is the softmax function.
The formula in the docs is the negative log softmax written as:
- log ( exp(x[class]) / sum(exp(x[j]))
x
are the logits here while exp()/sum(exp())
is the softmax function.