you can use nn.LogSoftmax
, it is numerically more stable and is less likely to nan than using Softmax
1 Like
you can use nn.LogSoftmax
, it is numerically more stable and is less likely to nan than using Softmax