Activation values in ResNet

ArshadIram · August 5, 2022, 9:12am

Hi Ptrblck,

I have used this solution. My batch size is 4, and I extracted the intermediate layer of my model.

I am getting the resultant output with shape(4,84). However, I am wondering 4 means that I only extract 1 batch activation value. How can I extract all batch activation values? Can you please help me out in this regard?

ptrblck · August 5, 2022, 9:14am

If you want to get the activations for all samples, you could append them to e.g. an activation list instead of replacing them in the dict.

ArshadIram · August 5, 2022, 9:18am

activation = {}
def get_activation(name):
def hook(model, input, output):
activation[name] = output.detach().numpy()
return hook

save_net.fc2.register_forward_hook(get_activation(‘fc2’))
with torch.no_grad():
for batch_idx, (x, y) in enumerate(trainloader):
save_net(x)

I replaced it with this

activations = []
def get_activation(name):
def hook(model, input, output):
activation= output.detach().numpy()
activations.append(activation)

return hook

Is it correct?

ptrblck · August 5, 2022, 4:34pm

Yes, this looks right (although the code is not properly formatted, but I assume it’s correct in your script). Did you try it out and was it working?

ArshadIram · August 6, 2022, 2:40am

Yep, I tried out.

I was trying to fetch out layer three activation values:
Linear(in_features=84, out_features=10, bias=True)

I am now getting this shape of data (images, 4, 10). Is it correct to reshape the array like this:

(images, 4*10) ?

Linear(in_features=84, out_features=10, bias=True)

ptrblck · August 6, 2022, 4:35am

It seems you are passing the inputs to the linear layer as [batch_size, 4, 84] which could be interpreted as temporal data where the linear layer is applied to each time step (dim1).
Does this fit your use case as usually you would flatten the activation before passing it to the linear layer via:

x = x.view(x.size(0), -1)
out = linear(x)

ArshadIram · August 7, 2022, 10:48am

I have already flattened the input before passing it to the linear layer.

flatten_values = torch.flatten(con_2_output, 1)
after flattening, these are the three linear layers, and I want to extract the last layer’s feature activation values. Do I need to flatten again before fetching the values from 2 or third linear layer?

(fc1): Linear(in_features=400, out_features=120, bias=True)
(fc2): Linear(in_features=120, out_features=84, bias=True)
(fc3): Linear(in_features=84, out_features=10, bias=True)

ptrblck · August 7, 2022, 10:06pm

Something seems to be off then, since the flattened activation should have two dimensions instead of [images, 4, 10]. Add a print statement to the forward method of your model and check why the activation is 3-dimensional.

No, you do not need to flatten the activations again between the linear layers, as the outputs should already have 2 dimensions.

ArshadIram · August 8, 2022, 1:52pm

After extracting the activations, I am converting them into a NumPy array. So before converting into the Numpy, the shape is, let’s say (80,10), and after converting into NumPy, it is (1, 80, 10). Why is it so?

ptrblck · August 8, 2022, 4:19pm

That should also not happen as seen in e.g. these code snippets:

x = torch.randn(1, 2, 3)
arr = x.numpy()
print(arr.shape)
# (1, 2, 3)

x = torch.randn(80, 10)
arr = x.numpy()
print(arr.shape)
# (80, 10)

x = torch.randn(1, 80, 10)
arr = x.numpy()
print(arr.shape)
# (1, 80, 10)

PyTorch will not change the shape of this tensor behind your back.

ArshadIram · August 11, 2022, 12:14am

I am getting a ndarray list of shapes [100, 64, 1, 1]

However, when I convert this list to np.array I am getting this shape
result = np.asarray(activations)

I don’t know why converting a list to np.array changes the shape.

ptrblck · August 11, 2022, 3:41am

np.asarray doesn’t change the shape but stacks the list of 100 numpy arrays of the shape [100, 64, 1, 1] into a single array.
Here is an example with other shapes which might be easier to understand:

list_of_arrays = [np.random.randn(5, 3), np.random.randn(5, 3)]
stacked_array = np.asarray(list_of_arrays)
print(stacked_array.shape)
# (2, 5, 3)

ArshadIram · August 11, 2022, 10:34am

Many thanks for this clarification. How can I ignore it, or I can just drop it for further processing?

ptrblck · August 11, 2022, 4:14pm

You should not ignore these values and try to figure out why you are using a list of arrays while it seems you would expect to use a single array instead.