How to implement many filters/weights in single Conv layer?

Fahd_Jerbi · February 20, 2021, 1:56pm

Hi, I’m trying to make a CNN model that use custom filters/weights. I started by using a pretrained model and changed it according to my need (figure below to better explanation of the idea). The goal is to have a 3 channels image then filter the input with all filters in each layer. I want to ask if the weights implementation is done right. Here is the code I have done, I know it might not be clean sorry for that.

model = models.alexnet(pretrained= True)
# replace avgpool: 
 class Identity(nn.Module): 
   def __init__(self): 
     super(Identity, self).__init__() 

   def forward(self, x): 
     return x

model.avgpool= Identity()

# remove some layer:
model.features[8] = nn.MaxPool2d(3, stride=2, padding=0, dilation=1, ceil_mode=False)
model.features = nn.Sequential(*[model.features[i] for i in range(9)])
model.classifier = nn.Sequential(nn.Dropout(p=0.5),
                                nn.Linear(3,3),
                                nn.ReLU(inplace=True),
                                nn.Linear(3,3))
############ 1st hidden layer:
#replace Conv2d:
model.features[0]= nn.Conv2d(3, 1, 3, stride=1, groups=1)
#change weights:
model.features[0].weight = nn.Parameter(data= torch.tensor([
                       [[1, 2, 1],
                        [0, 0, 0],
                        [-1, -2, -1]],
                        
                       [[-1, 0, 1],
                        [-2, 0, 2],
                        [-1, 0, 1]],

                       [[2, 1, 0],
                        [1, 0, -1],
                        [0, -1, -2]],
                          
                       [[0, -1, -2],
                        [1, 0, -1],
                        [2, 1, 0]]], dtype= torch.float32), requires_grad= True)
###########2nd hidden layer:
#replace 2nd Conv2d:
model.features[3]= nn.Conv2d(3, 1, 3, stride=1, groups=1)
#add weights: 
model.features[3].weight = nn.Parameter(data= torch.tensor([
                     [[0.081, 0.17789, 0.081],
                      [0, 0, 0],
                      [-0.081, -0.17789, -0.081]],
                                         
                     [[0.081, 0, -0.081],
                      [0.17789, 0, -0.17789],
                      [0.081, 0, -0.081]],

                     [[0.17789, 0.081 , 0],
                      [0.081, 0, -0.081],
                      [0, -0.081, -0.17789]],

                      [[0, -0.081, -0.17789],
                       [0.081, 0, -0.081],
                       [0.17789, 0.081, 0]]], dtype= torch.float32), requires_grad= False)
#######3rd Hidden layer: 
#replace 3rd Conv2d:
model.features[6]= nn.Conv2d(3, 1, 3, stride=1, groups=1)
#add weights:
model.features[6].weight= nn.Parameter(data= torch.tensor([
          [[0.0455, -0.1789, 0.0455],
           [0.1, -0.388, 0.1],
           [0.0455, -0.1789, 0.0455]],

          [[0.0455, 0.1, 0.0455],
           [-0.1789, -0.388, -0.1789],
           [0.0455, 0.1, 0.0455]],

          [[0.1272, 0, -0.1272],
           [0, 0, 0],
           [-0.1272, 0, 0.1272]],
           
          [[-0.1272, 0, 0.1272],
           [0,0,0],
           [0.1272, 0, -0.1272]]], dtype=torch.float32), requires_grad= False)

ptrblck · February 21, 2021, 5:33am

The model.features[0].weight parameter has a shape of [4, 3, 3] so one dimension is missing.
nn.Conv2d defines the weight parameter as [out_channels, in_channels, height, width].
Based on the current code, I guess that the number of in_channels is missing?
If that’s the case, you could unsqueeze the tensor in dim1 and repeat it 3` times.

Fahd_Jerbi · February 25, 2021, 4:30pm

Yes Mr. @ptrblck, in_channels was missing and the weights implementation was wrong but now it worked as this:

model.features[0].weight = nn.Parameter(data= torch.tensor([
                        [[[1, 2, 1], [0, 0, 0], [-1, -2, -1]],
                         [[1, 2, 1], [0, 0, 0], [-1, -2, -1]],   
                         [[1, 2, 1], [0, 0, 0], [-1, -2, -1]]], 
                                          ###############  end of filter n°1
                        [[[-1, 0, 1], [-2, 0, 2], [-1, 0, 1]],
                         [[-1, 0, 1], [-2, 0, 2], [-1, 0, 1]],
                         [[-1, 0, 1], [-2, 0, 2], [-1, 0, 1]]],
                                            ############## end of filter n°2
                         [[[2, 1, 0], [1, 0, -1], [0, -1, -2]],
                          [[2, 1, 0], [1, 0, -1], [0, -1, -2]],
                          [[2, 1, 0], [1, 0, -1], [0, -1, -2]]],
                                            ############## end of filter n°3
                         [[[0, -1, -2], [1, 0, -1], [2, 1, 0]],
                          [[0, -1, -2], [1, 0, -1], [2, 1, 0]],
                          [[0, -1, -2], [1, 0, -1], [2, 1, 0]]]],  dtype= torch.float32), requires_grad= True)

I just encountered a new error since I changed Linear layer :

mat1 dim 1 must match mat2 dim 0

If you would please to suggest any solution that may work it would be appreciated

ptrblck · February 25, 2021, 6:26pm

This error might be raised, if the number of input features of an activation doesn’t match the expected in_features of a linear layer.
Try to print the shape of the flattened activation before feeding it into a linear layer and make sure the features match.