Pipe Pretrained Model to Custom Layers

Paran0 · January 27, 2020, 11:48pm

Consider the following code which will print the modules of the faster_rcnn model in PyTorch.

model_fastercnn = torchvision.models.detection.fasterrcnn_resnet50_fpn(pretrained=True) 
modules_2 = list(model_fastercnn.children())
for iter, module in enumerate(modules_2):
    print("###### {} ######".format(iter))
    print(module)

Here is the last module which printed by previous code.

###### 3 ######
RoIHeads(
  (box_roi_pool): MultiScaleRoIAlign()
  (box_head): TwoMLPHead(
    (fc6): Linear(in_features=12544, out_features=1024, bias=True)
    (fc7): Linear(in_features=1024, out_features=1024, bias=True)
  )
  (box_predictor): FastRCNNPredictor(
    (cls_score): Linear(in_features=1024, out_features=91, bias=True)
    (bbox_pred): Linear(in_features=1024, out_features=364, bias=True)
  )
)

Now let’s say we want to use the output of “fc7” (RoIHeads->box_head->fc7) as input to another custom layers, for example, one Linear 1024 -> 512, also want to use the pre-trained weight until the fc7 and just train the weights of the last added layer for some loss function. Since the last module is wrapped with the class is there any way for doing so?

I bring up the last sentence since for example consider the resnet152, by printing the modules, I learn that It contains “9” layers and the last layer is an FC, I omit the last layer and build another Sequential network then added another layer and train the model using the only parameter of the last layer.

ptrblck · January 28, 2020, 4:30am

For a quick experiment, I would register a foward hook to this particular layer, store the output activation and reuse them in another model outside of this FasterRCNN model.

However, if you want to properly change the workflow, I would recommend to derive a custom model and adapt the forward methods as you want them to be.

Paran0 · January 28, 2020, 8:30pm

Hi, Thanks for your reply.
Is there any code-snippet available for doing so? for example in the second case when we derive an instance and modify the layers, how we could assign the weights of the pre-trained model in this case?

ptrblck · January 28, 2020, 10:04pm

You could use this as the base code to modify your forward method for e.g. resnet50:

class MyResnet50(models.resnet.ResNet):
    def __init__(self, pretrained=False):
        # Pass default resnet50 arguments to super init
        # https://github.com/pytorch/vision/blob/e130c6cca88160b6bf7fea9b8bc251601a1a75c5/torchvision/models/resnet.py#L260
        super(MyResnet50, self).__init__(models.resnet.Bottleneck, [3, 4, 6, 3])
        if pretrained:
            self.load_state_dict(models.resnet50(pretrained=True).state_dict())

    def _forward_impl(self, x):
        # See note [TorchScript super()]
        x = self.conv1(x)
        x = self.bn1(x)
        x = self.relu(x)
        x = self.maxpool(x)

        x = self.layer1(x)
        x = self.layer2(x)
        x = self.layer3(x)
        x = self.layer4(x)

        x = self.avgpool(x)
        x = torch.flatten(x, 1)
        x = self.fc(x)

        return x

    def forward(self, x):
        return self._forward_impl(x)


model = MyResnet50(pretrained=True)
x = torch.randn(2, 3, 224, 224)
output = model(x)

Mhmmd · August 20, 2021, 11:04am

I want to get output from layer fc6 (with dimensions of 1024). How can I do this?
Please guide me with a sample code

ptrblck · August 20, 2021, 6:36pm

You could use forward hook as described here and register the hook on the fc6 layer.

Mhmmd · August 27, 2021, 6:35pm

I use this code, but the output I receive is for 1000 proposal boxes while I need the features of the selected boxes. How can I get them?

activation = {}
def get_activation(name):
def hook(model, input, output):
activation[name] = output.detach()
return hook
model.roi_heads.box_head.fc6.register_forward_hook(get_activation(‘fc6’))

model.eval()
FEATS = []
inputs = list_img
features = {}
preds = model(inputs)
FEATS.append(activation[‘fc6’].cpu().numpy())

ptrblck · August 27, 2021, 8:16pm

I don’t know where the boxes are selected, but in case it’s done by a subsequent layer, you might want to register the hook to this once instead.