Torch.cat and backpropagation

uhmbg · March 30, 2020, 2:21pm

Does torch.cat work with backpropagation?

albanD · March 30, 2020, 2:38pm

Hi,

Yes,
All operations that works with floating point numbers work with backprop in pytorch (or you will get an error when trying).

uhmbg · March 30, 2020, 2:42pm

Thank you!
So if I have something like that
class NeuralNet2(nn.Module):
def init(self):
super(NeuralNet2, self).init()

    self.input = nn.Linear(2, 40)
    self.hiddenLeft = nn.Linear(40, 2)
    self.hiddenRight = nn.Linear(40, 2)
    self.out = nn.Linear(4, 6)

def forward(self, x):

    x = self.input(x)
    xLeft, xRight = torch.sigmoid(self.hiddenLeft(x)), torch.sigmoid(self.hiddenRight(x))
    x = torch.cat((xLeft, xRight), 2)
    x = F.sigmoid(self.out(x))

    return x

Is it necessary to re-implement the backward? Or how would it work in this case?

albanD · March 30, 2020, 3:24pm

No you don’t need to reimplement the backward. Since we can just backprop through the cat operation, gradients will be computed for all your parameters.

uhmbg · March 30, 2020, 4:23pm

So the concatenation causes something similar to what would be a sequential model?

albanD · March 30, 2020, 4:24pm

Not sequential because your modules don’t operate on the result of the previous one.
It just concatenates two Tensors. That is it.

uhmbg · March 30, 2020, 4:31pm

That is to say, independently of the concatenation, the neurons will be able to keep a history of who precedes them at the time of updating the weights

albanD · March 30, 2020, 4:39pm

the neurons will be able to keep a history of who precedes them at the time of updating the weights

Not sure what that means.
But if the question is “Will it be able to compute gradients?”, then the answer is yes.

uhmbg · March 30, 2020, 5:32pm

If you would like to access specific positions in a tensor, how would you do it most efficiently?
tensor([[[0.1743, 0.2439, 0.2543]],
[[0.1325, 0.0778, 0.2292]],
[[0.3474, 0.1554, 0.1000]],
[[0.1599, 0.0283, 0.0305]],
[[0.3842, 0.3997, 0.2972]],
[[0.1876, 0.0815, 0.3047]],
[[0.2280, 0.1096, 0.2919]],
[[0.3843, 0.3552, 0.1672]]])

And I want the tensor but only with the first and third columns???

albanD · March 30, 2020, 6:08pm

Hi,

The simplest would be to use indexing: t[:, (0, 2)].

uhmbg · March 30, 2020, 6:14pm

Thanks!
It would not be so t[:, :, (0, 2)]?

albanD · March 30, 2020, 6:16pm

Ho yes, sorry, though it was a 2D Tensor.

uhmbg · March 30, 2020, 6:41pm

And if I wanted to concatenate a list of tensors, how would I do it with cat?

albanD · March 30, 2020, 6:44pm

cat takes any iterable as input. So you can just give the list to cat.

uhmbg · March 31, 2020, 2:47pm

Is it proper to do this torch.optim.Adam(net.parameters()) after this net(inputs.float())? I need it so I define the network in the forward for necessary reasons.

    for i, [inputs, labels] in enumerate(trainloader):
        inputs = inputs.to(device)
        labels = labels.to(device)

        # Forward + backward + optimize
        outputs = net(inputs.float())

        optimizer = torch.optim.Adam(net.parameters())
        optimizer.zero_grad()

        loss = criterion(outputs, labels.float())
        loss.backward()
        optimizer.step()

albanD · March 31, 2020, 2:53pm

I would advise against it because optimizers have states that you want to keep from one iteration to the next.
You can provide a dummy input to the network when you create it if you really need to know the input size to create all the Parameters.

uhmbg · March 31, 2020, 3:02pm

I get the following error if I put the sentence before the forward:
ValueError: optimizer got an empty parameter list.

And it’s happening to me because I define the net in the forward, because the parameters of the first layers are defined at that time, what do you recommend?

uhmbg · March 31, 2020, 3:07pm

Is it possible to put dummy values to create the layers? but in that case, this net.parameters() is not true at the time, how can this be updated later with the real parameters?

albanD · March 31, 2020, 3:12pm

I think you’re not using Parameters properly.
They are supposed to contain a set of weights that you iteratively update so that they contain the right value. So you should not re-create them all the time as they are supposed to be updated by the optimizers iteratively.

uhmbg · March 31, 2020, 3:19pm

So it is necessary to define the net, that is nn.Linear(n1, n2) before the forward? In my case I only have the values of n1 and n2 computed during the forward, what can I do?