Differentiable Indexing

justusschock · May 7, 2018, 7:54am

I want to do something like this, but I need it be be differentiable w.r.t the index-tensors. is there any possibility to achieve this?

import torch

# initialize tensor
tensor = torch.zeros((1, 400, 400)).double()
tensor.requires_grad_(True)

# create index ranges
x_range = torch.arange(150, 250).double()
x_range.requires_grad_(True)
y_range = torch.arange(150, 250).double()
y_range.requires_grad_(True)

# get indices of flattened tensor
x_range = x_range.long().repeat(100, 1)
y_range = y_range.long().repeat(100, 1)
y_range = y_range.t()
tensor_size = tensor.size()
indices = y_range.sub(1).mul(tensor_size[2]).add(x_range).view((1, -1))

# create patch
patch = torch.ones((1, 100, 100)).double()


# flatten tensor
tensor_flattened = tensor.contiguous().view((1, -1))

# set patch to cells of tensor_flattend at indices and reshape tensor
tensor_flattened.scatter_(1, indices, patch.view(1, -1))
tensor = tensor_flattened.view(tensor_size)

# sum up for scalar output for calling backward()
tensor_sum = tensor.sum()

# calling backward()
tensor_sum.backward()

# alternative to avoid summing tensor:
tensor.backward(torch.ones_like(tensor))

magz · May 7, 2018, 9:36am

I think you are looking for this? Indexing a variable with a variable

justusschock · May 7, 2018, 9:39am

Not directly. I want to set values in a tensor via indexing and I need this operation to be differentiable w.r.t to the indices.

Basically what I want is a version of tensor[idices] = m which is differentiable w.r.t indices

colesbury · May 7, 2018, 6:53pm

You need a “soft” function to get a meaningful gradient. Use something like torch.nn.functional.grid_sample which interpolates between values in your tensor.

justusschock · May 8, 2018, 9:24am

Thanks. Could you maybe provide a minimum working example how to use it in this case?
Even if it is only pseudo code it would help me a lot.

colesbury · May 8, 2018, 7:40pm

import torch
import torch.nn.functional as F

src = torch.arange(25, dtype=torch.float).reshape(1, 1, 5, 5).requires_grad_()  # 1 x 1 x 5 x 5 with 0 ... 25
indices = torch.tensor([[-1, -1], [0, 0]], dtype=torch.float).reshape(1, 1, -1, 2)  # 1 x 1 x 2 x 2
output = F.grid_sample(src, indices)
print(output)  # tensor([[[[  0.,  12.]]]])

(-1, -1) is the top-left corner. (0, 0) is the center. The src has to be 4-d or 5-d (N x C x IH x IW). Same with indices. If you don’t have batch size (N) or channels ( C) set these to dimensions to size 1.

justusschock · May 10, 2018, 9:53am

Thanks a lot. I will try this ASAP

Ishan_Gupta · June 4, 2018, 12:57am

Hi @colesbury , will the indexing be differentiable when I need to apply different loss operations depending upon the magnitude of the values in the tensor.

e.g predicted_tensor[mask_1] = loss_fn_1 ()
& predicted_tensor[mask_2] = loss_fn_2 ()

Soham_Tamba · May 2, 2020, 9:27am

How do u use to set

tensor[indices] = m

It can only be used to return tensor[indices]

jimmy15923 · May 20, 2020, 3:54am

Hi! I am wondering what’s meaningful gradient here? Is that possible the gradients are no meaning even the whole forward pass are differentiable? (the grad_fn exists).

Another question is that I only need to crop some region from the feature maps, instead using torch.nn.functional.grid_sample, can I simply use the feature_maps[y1:y2, x1:x2]. My target is to make the coordinates of box trainable, which can find the important region of feature maps. Thanks!