Why are there 3 Relu functions (or maybe even more)?

Brando_Miranda · August 7, 2017, 7:32pm

I noticed that one can use the functional library:

import torch.nn.functional as F
...
x = F.relu(self.conv1(x))

or use the torch.nn as in:

torch.nn.ReLU

or even the clamp operator:

x.clamp(min=0)

is there a reason why there are three ways to do the same thing?

fmassa · August 7, 2017, 7:49pm

F.relu is the functional interface of torch.nn.ReLU. Modules like torch.nn.ReLU are sometimes handy, for example when quickly creating a model using nn.Sequential.
You can’t add F.relu in a nn.Sequential, as it expects an object that is inherited from nn.Module.

About clamp, it is a tensor/variable function which is more generic than ReLU, and it works for both Tensor and Variable, while F.relu only works for variables. Also, ReLU is a common enough module to deserve it’s own function, instead of having to write x.clamp(min=0).

Brando_Miranda · August 7, 2017, 7:51pm

maybe what Im confused about is why functional interface even exists. Do you know?

fmassa · August 7, 2017, 7:54pm

Yes. The functional interface is very handy when you want to perform some more complex operations.
For example, let’s say that you want the weights of your convolution to be the output of some network (for example in hyper networks.
In this case, you can’t use the Module interface, as it creates the weights during initialization, but you can easily use the functional interface for that

weights = net1(input)
res = F.conv2d(input, weights)