Pretraining with MobileNetv3

darkdante · April 4, 2021, 2:24am

Hi, I am trying to use mobilenetv3 in transfer learning to classify images in 4 categories. At the first, I try to freeze the weights so I can use them for feature extraction and I add a custom layer at the end. However, I get the following error:

Traceback (most recent call last):
File “trial13MobileNetxray.py”, line 431, in
train(mobilenetv3, args)
File “trial13MobileNetxray.py”, line 230, in train
loss.backward() # backward pass (compute parameter updates)
File “/opt/conda/lib/python3.6/site-packages/torch/tensor.py”, line 245, in backward
torch.autograd.backward(self, gradient, retain_graph, create_graph, inputs=inputs)
File “/opt/conda/lib/python3.6/site-packages/torch/autograd/init.py”, line 147, in backward
allow_unreachable=True, accumulate_grad=True) # allow_unreachable flag
RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn
File “/opt/conda/lib/python3.6/site-packages/torch/tensor.py”, line 245, in backward
torch.autograd.backward(self, gradient, retain_graph, create_graph, inputs=inputs)
File “/opt/conda/lib/python3.6/site-packages/torch/autograd/init.py”, line 147, in backward
allow_unreachable=True, accumulate_grad=True) # allow_unreachable flag
RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn

My code is as follows:

#Otain pretrained mobilenet from pytorch models
mobilenetv3 = torchvision.models.mobilenet_v3_large(pretrained=True)

#Freeze the pretrained weights for use 
for param in mobilenetv3.parameters():
  param.requires_grad = False

# add custom layers to prevent overfitting and for finetuning 
mobilenetv3.fc = nn.Sequential(nn.Dropout(0.2),
                              nn.BatchNorm1d(1280), #320
                              nn.ReLU(),
                              nn.Dropout(0.3),      
                              nn.Linear(320, 4),
                              nn.LogSoftmax(dim=1)
                              )

Please help

Dwight_Foster · April 4, 2021, 3:29pm

When you set the parameters of the mobilenet to have no gradient the backward function cannot be calculated because there is nothing to calculate it with. Are you trying to freeze the mobilenet layers? If so there is a better way to do that. When you define your optimizer only put in the parameters you want like this:

optimizer  = optim.Adam(mobilenetv3.fc.parameters())

that will only update the fc parameters.

omarfoq · April 4, 2021, 4:26pm

Hello @Dwight_Foster?

I think the part of code he shows is not incorrect. In fact, I think this the standard way of freezing gradient: if he don’t set param.requires_grad = False, all gradients well be computed during the backward iteration, which is not memory efficient if he don’t was to use all of them to update the model.

He probably did a mistake when creating the optimizer, and need to create it as you said

Dwight_Foster · April 4, 2021, 4:50pm

Yes thank you, you are correct. Thank you for pointing that out.