How can l load my best model as a feature extractor/evaluator?

ptrblck · October 14, 2021, 5:31pm

I think it depends on your use case and probably also coding style.
E.g. if I would be working on a new model architecture, where now different features will be returned, I would override the forward. This would make sure I can initialize the model using its new definition without any manipulation on the model itself.
On the other hand, if I just want to check some intermediates e.g. for debugging, I would use hooks as I can directly add them to the model without any changes to it.
Also, I believe that hooks are not scriptable right now.

Your use case might be of course different.

blade · October 14, 2021, 5:46pm

Thank you for the prompt response!

bigtree · October 23, 2021, 6:40am

@ptrblck
I wonder about my case. How could I print the output from Unet_Netzero.pretrained.layer1.3.0.act1

Unet_Netzero(
  (quant): QuantStub()
  (pretrained): Module(
    (layer1): Sequential(
      (0): Conv2d(3, 32, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1), bias=False)
      (1): BatchNorm2d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
      (2): ReLU(inplace=True)
      (3): Sequential(
        (0): DepthwiseSeparableConv(
          (conv_dw): Conv2d(32, 32, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), groups=32, bias=False)
          (bn1): BatchNorm2d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (act1): ReLU()
          (hardtanh1): Hardtanh(min_val=0, max_val=6, inplace=True)
          (se): Identity()
          (conv_pw): Conv2d(32, 16, kernel_size=(1, 1), stride=(1, 1), bias=False)
          (bn2): BatchNorm2d(16, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (act2): ReLU()
          (hardtanh2): Hardtanh(min_val=0, max_val=6, inplace=True)
          (skip_add): FloatFunctional(
            (activation_post_process): Identity()
          )
        )
      )

Thanks.

ptrblck · October 23, 2021, 6:48am

Forward hooks would work as given in this thread in e.g. this post.

bigtree · October 23, 2021, 7:05am

@ptrblck Thanks for the prompt reply.
I still not sure I understand that correctly.
For my case, the DepthwiseSeparableConv is defined in another nn.moduel and didn’t show in the forward() definition of the main network (i.e., Unet_Netzero)

If I want to access pretrained.layer1.3.0.bn1, is the following correct?
model.pretrained.layer1.3.0.bn1.register_forward_hook(get_activation(‘bn1’))

Thanks

ptrblck · October 23, 2021, 7:14am

I’m not sure I understand this question completely. If the module isn’t used in the forward method, there won’t be any forward activations and the hook won’t capture anything.

Yes, looks correct.

Hayat_Ullah · January 21, 2022, 6:25pm

Hi ptrblck, can we extract any intermediate convolutional layer for feature maps visualisation?

ptrblck · January 21, 2022, 7:46pm

Yes, you can register a forward hook on any internal conv layer to get the activation.

jS5t3r · April 16, 2022, 9:59am

I see that you have printed the values of print(activation['fc2']). I can read that F.relu() is applied afterwards.
How to get the values of F.relu(self.fc2)?

ptrblck · April 16, 2022, 9:15pm

If you want to use forward hooks as well, you could replace the functional F.relu with the nn.ReLU module and register the hook to it.
If not, you can store the activation output of F.relu directly in e.g. a dict inside the forward method.

jS5t3r · April 17, 2022, 9:43am

Do you have an example of “store the activation output of F.relu directly in e.g. a dict inside the forward method.” I guess you have already explained it somewhere?

ptrblck · April 17, 2022, 8:12pm

This should work:

act = {}

class MyModel(nn.Module):
    def __init__(self):
        super().__init__()
        self.fc1 = nn.Linear(1, 1)
        self.fc2 = nn.Linear(1, 1)
        
    def forward(self, x):
        x = F.relu(self.fc1(x))
        act['fc1.relu'] = x.clone()
        x = self.fc2(x)
        return x
    
model = MyModel()
x = torch.randn(1, 1)

print(act)
# {}

out = model(x)
print(act)
{'fc1.relu': tensor([[0.3142]], grad_fn=<CloneBackward0>)}

charlieyun · May 1, 2022, 7:32am

Instead of getting the output of an intermediate layer, is there a way to get the input of an intermediate layer? For example if my forward function looks like this, where fc1 and fc2 are just the linear layers:

def forward(self,x,x1):
        x = F.relu(self.fc1(x))
        x_cat = torch.cat((x,x1))
        x = self.fc2(x_cat)
        return x

How can I get x_cat in this case? Thank you!

gesm · December 5, 2022, 3:10pm

Hi! is there any possibility to do the same with YOLOv7 architecture ? I’d like to get the feature maps of each layer using yolov7 model (below is example of model layers)

utoShape(
  (model): Model(
    (model): Sequential(
      (0): Conv(
        (conv): Conv2d(3, 32, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
        (bn): BatchNorm2d(32, eps=0.001, momentum=0.03, affine=True, track_running_stats=True)
        (act): SiLU()
      )

@ptrblck

Thanks!

ptrblck · December 5, 2022, 4:36pm

Yes, my previously posted code snippet should work on any model which properly registered submodules.

Mordehay_M · March 30, 2023, 1:43pm

what about the loading of the optimizer?
Since there is the new fc layer(because the feature extraction), we can not load the original optimizer, right?
So, what can we do?

ptrblck · March 30, 2023, 5:56pm

Yes, this might be the case and you could create an optimizer missing the new fc layer, load its state_dict, and create a separate optimizer for the newly added fc layer.

peony · May 27, 2023, 4:57am

Can we implement this in torchvision resnet50 model without creating a separate class MyModel?

ptrblck · May 27, 2023, 4:59am

Yes, you can use forward hooks in any model using nn.Modules.

Andualem_Welabo · July 1, 2023, 11:23am

Hello, I’m new to PyTorch and this platform, but I must say that I like it a lot and feel you guys are awesome. I wonder if there is any way to create a feature extractor model from images by YOLOv5 model from intermediate layer (SPP layer) with frozen weight.