Why there isn't a method named torch.softmax

bat67 · July 26, 2020, 2:58am

Hi~

For sigmoid, there’s torch.nn.Sigmoid to generate a instance of Sigmoid class, and torch.nn.functional.sigmoid function. I noticed another torch.sigmoid function and some related topics (link1, link2).

However, for softmax function for example, there’s only torch.nn.Softmax and torch.nn.functional.softmax, but no torch.softmax. I am confused and want to know what is the thinkings behind designing these. And are there other functions designed like this?

Thanks!

tom · July 26, 2020, 7:49pm

So the idea is to put more deep-learning-oriented functions in torch.nn.functional and keep general-purpose functions in under torch directly. softmax was deemed to fall into the former, sigmoid in the latter category.
While there is torch.softmax, this is by accident (which is why it is not documented) rather than as a design (previous version of PyTorch didn’t have the fancy torch._C._nn module to put the C++-implementations of torch.nn.functional-functions). I would advise to only use the documented variants to stay out of trouble should someone start to clean up.

Best regards

Thomas

bat67 · July 27, 2020, 11:13am

Thank you very much!

isarandi · March 6, 2025, 11:51pm

Apparently torch.softmax is now a documented alias.