Use pre trained resnet101 for regression data

anotherone_one · January 13, 2021, 3:25pm

Hello everyone!
I wanted to use the resnet101 for a regression like problem. So, the input of the network is a image with an associated target (a number), and I want to get an output by training a model like regression.

What I am doing is adding a linear layer in the end of the resnet101 so the output if a single value.

model = models.resnet101(pretrained=True)

num_ftrs = model.fc.in_features

model.fc = nn.Linear(num_ftrs, 1)

Does this make sense? How would you suggest to use resnet or other pre trained cnn for continuous data?

DAJINZI01 · January 14, 2021, 4:13am

I have the same problem about how to use a pre-trained model to my own work.

anotherone_one · January 14, 2021, 9:17am

my main question is basically how to use the pre trained resnet for continous data instead of categorical data! For categorical data you do what I put above and the linear layer is (num_ftrs, number of classes). Then, when extracting the prediction in the test phase you can use a softmax to get the probability of each class!

VirginieBfd · January 14, 2021, 2:50pm

This solution looks correct.

model = models.resnet101(pretrained=True)
num_ftrs = model.fc.in_features
model.fc = nn.Linear(num_ftrs, 1)

Ensure you change you use a loss made for regression problems such as torch.nn.MSELoss.

You can use a network pre-trained on a classification problem and transfer it to a regression problem. You might need to unfreeze the last blocks to adapt it to your application.

The first blocks from ResNet or another net, will detect features such as edges, forms, which is mostly invariant when changing datasets.

Hope this helps!

anotherone_one · January 15, 2021, 1:50pm

Thank you !! How can I unfreeze the last blacks ?

VirginieBfd · January 15, 2021, 1:57pm

Something like:

for param in model.parameters():
     param.requires_grad = False # False when you freeze the layer / True when you want to train it

anotherone_one · January 15, 2021, 2:11pm

Thank you! And other thing, why I need to unfreeze the last blocks? I think I am missing something here, like, theoretically I don’t understand very well why it is done like this.

VirginieBfd · January 15, 2021, 2:16pm

This tutorial gives both the explanations and the example with ResNet:
https://towardsdatascience.com/transfer-learning-picking-the-right-pre-trained-model-for-your-problem-bac69b488d16

anotherone_one · January 15, 2021, 2:19pm

Thank you so much for the help!!