How to delete layer in pretrained model?

EonianSky · May 7, 2018, 8:09am

I am using the pre-trained model of vgg16 through torchvision. Now I don’t need the last layer (FC) in the network. How should I remove it?

ptrblck · May 7, 2018, 8:27am

Don’t you need this layer at all, i.e. do you want to get the avgpool back from your model?
If so, you could createn a module returning its input:

class Identity(nn.Module):
    def __init__(self):
        super(Identity, self).__init__()
        
    def forward(self, x):
        return x


model = models.resnet18(pretrained=False)
model.fc = Identity()
x = torch.randn(1, 3, 224, 224)
output = model(x)
print(output.shape)

From the code it looks like you are using a resnet, so I used it in my examples.

However, usually you would like to use another linear layer with your number of classes as its output.

model.fc = nn.Linear(512, num_classes)

EonianSky · May 7, 2018, 8:41am

Thank you very much.
PS:In my example, what is needed is the output of avgpool, no need for FC at all.

EonianSky · May 7, 2018, 9:17am

I found I put a wrong figure… If I use vgg16 and want to delete some layers,what should I do?

ptrblck · May 7, 2018, 9:24am

In this case you could use the following code:

model.classifier = nn.Sequential(*[model.classifier[i] for i in range(4)])
print(model.classifier)

EDIT:
Alternatively, you can also call .children, since the range indexing might be cumbersome for large sequential models.

model.classifier = nn.Sequential(*list(model.classifier.children())[:-3])

cad8bd801dfbc87c5d0f · August 9, 2018, 4:12am

Sorry but why did my model say ‘ResNet’ object has no attribute ‘classifier’

lxtGH · August 9, 2018, 4:22am

Try to use .fc attribute Because Resnet doesn’t have classifier (with dropout in it), the above is vgg net @cad8bd801dfbc87c5d0f

Shihab_Shahriar · January 7, 2019, 3:06pm

using this approach however, I found specifying different learning for different layers in optimizer quite hard, which is very common in transfer learning. Can you please show how to achieve that in simpler ways? Thanks

ptrblck · January 7, 2019, 5:05pm

Could you post your model architecture and which learning rates you would use for which layers?

Shihab_Shahriar · January 7, 2019, 6:38pm

In resnet18, I wanted to simply replace ‘fc’ final layer, something like:
model.fc = nn.Linear(512,10)

Now if I wanted to do:

param_groups = [
    {'params':model.fc.parameters(),'lr':.001},
    {'params':model.others.parameters(),'lr':.0001},
]
optimizer = Adam(param_groups)

Finding model.others.parameters() part seemed hard. I could probably replace it with params array below:

params = []
for child in list(model.children())[:-1]:
    params.extend(list(child.parameters()))

Or define a class overriding nn.Module, which I did in the end. I was just wondering if there’s any elegant one liner, nothing big deal Thanks.

ptrblck · January 7, 2019, 8:25pm

Your current code looks good!
If the “filtering” would be a bit complicated, you could use Python filter to remove the model.fc layer by name, but in your use case I’m not sure there is a faster or more elegant way of passing others to the optimizer.

ki2rin · January 10, 2019, 12:32am

‘classifier’ is a key of the layer.

(classifier): Sequential(
…
)

So, you can refer to a sequence of a specific layer by its key.
Perhaps, ResNet has a different key name to VGG in its implementation.
@cad8bd801dfbc87c5d0f

ghazal_sahebzamani · February 12, 2019, 11:24pm

Sorry if this question seems trivial; why can’t we simply use:
model.classifier=model.classifier[:-1]
Why is the Sequential line necessary?

ptrblck · February 12, 2019, 11:29pm

Your assignment might actually work and I’m not sure, if the nn.Sequential wrapper was necessary when I’ve written the code.
I think I just wanted to make sure the modules don’t get messed up somehow.
Anyway, I’ve tested your line of code using vgg16 and it seems to work perfectly, so you can just stick to your approach.

marwa · March 8, 2019, 2:34pm

I tried to delete only the avgpool of the resnet but i got always mismatch errors. To avoid this error I replaced
resnet.avgpool = nn.AvgPool2d(kernel_size=7, stride=1, padding=0)
by
resnet.avgpool = nn.AvgPool2d(kernel_size=1, stride=1, padding=0)
and then
resnet.fc = nn.Linear(512x7x7,number_of_classes).
It is not the optimized way to deal with the problem. Do you have a better solution please?

ptrblck · March 8, 2019, 9:01pm

nn.AvgPool2d with a kernel size of 1 acts like an identity layer, your model basically got rid of the pooling layer.
Alternatively, you could just define the resnet.avgpool as a “real” custom Identity layer:

class Identity(nn.Module):
    def __init__(self):
        super(Identity, self).__init__()
        
    def forward(self, x):
        return x

model.avgpool = Identity()
model.fc = nn.Linear(512*7*7, nb_classes)

Both approaches will yield the same result.

Ivamcoder · July 22, 2019, 10:37pm

Hi @ptrblck,

If we want to delete some sequenced layers in pretrained model, How could we do? for example in renet assume that we just want first three layers with fixed weights and omit the rest, I should put Identity for all layers I do not want? not any other way?

ptrblck · July 23, 2019, 9:41am

Not necessarily.
If you would like to keep the forward method without overriding it, replacing a few layers with nn.Identity layers might be the fastest approach.
However, if you would like to just use a few specific layers, I would recommend to override the class and write your custom model or alternatively reuse these layers in your custom model by passing them to your model.

pablo_sanchez · August 23, 2019, 8:41am

Dear
Thank you very much for your help with the layer removal information, this helped me to fine tune in squeezenet.
Regards

Saharkakavand · December 16, 2019, 5:01pm

@ptrblck
is there any way that I can remove specific filters in pretrained model on vgg16(the base of faster rcnn) ? I want to prune vgg16 filter wise, so I need to remove specific filters in a layer and as a consequence in the next layer too.
thanks