AFAIK, torch.nn.Sigmoid
calls torch.nn.functional.sigmoid
in the background, and according to this answer, the functional and torch.xxx
calls differ with their backwards implementation (which is more efficient and GPU capable in the torch.nn
case).
1 Like