How to implement siamese network?

melody-rain · April 26, 2017, 10:16am

Below is my network:

class Siamese(nn.Module):
    def __init__(self):
        super(Siamese, self).__init__()
        self.cnn1 = nn.Sequential(
            nn.Conv2d(1, 20, kernel_size=5),
            nn.MaxPool2d(2, stride=2),
            nn.Conv2d(20, 50, kernel_size=5),
            nn.MaxPool2d(2, stride=2))

        self.fc1 = nn.Sequential(
            nn.Linear(50 * 4 * 4, 500),
            nn.ReLU(inplace=True),
            nn.Linear(500, 10),
            nn.Linear(10, 2))

        # self.cnn2 = self.cnn1
        # self.fc2= self.fc1

        # print self.cnn1 is self.cnn2

    def forward(self, input1, input2):
        output1 = self.cnn1(input1)
        output1 = output1.view(output1.size()[0], -1)
        output1 = self.fc1(output1)

        output2 = self.cnn1(input2)
        output2 = output2.view(output2.size()[0], -1)
        output2 = self.fc1(output2)
        # print output1 - output2
        output = torch.sqrt(torch.sum((output1 - output2) * (output1 - output2), 1))

        return output1, output2, output

But it crashed when training with the error:

*** Error in `python': free(): invalid next size (fast): 0x00000000042b8d50 ***
Aborted (core dumped)

I have no idea what happens. It seems my network definition is not correct.

apaszke · April 26, 2017, 4:09pm

It’s a bug. Can you please tell me what input sizes are you using?

melody-rain · May 1, 2017, 1:01am

Input size is (28L, 28L) since I am using MNIST dataset.
I have to fix the seed of random number generator to reproduce.

BTW, I use

criterion = nn.HingeEmbeddingLoss()

as the loss function.

apaszke · May 1, 2017, 4:12pm

Can you please upload your sript in a GitHub gist, so I could try to take a look and reproduce the issue? Thanks!

melody-rain · May 2, 2017, 2:25am

Hi.
After upgrading to 0.1.12_1, I got new error.

Segmentation fault (core dumped)

The previous error disappeared.

Please check https://github.com/melody-rain/siamese-network for the code.

li_bo · May 4, 2017, 10:30am

Hi,
Can you please give an example that use Siamese network ? I googled pytorch Siamese but got no worked examples.

Thanks

smth · May 5, 2017, 2:19am

@melody it is because of NaN values appearing in your inputs to MaxPooling. If your training is stable and you dont get nans, then it will work fine. Regardless, I have fixed this in master via: https://github.com/pytorch/pytorch/commit/a6876a4783ce3d1bb3c6ba69f54c31983097ed17

hadikazemi · July 4, 2017, 3:43pm

@li_bo you can find an example here: https://github.com/hadikazemi/Machine-Learning/tree/master/PyTorch

Marvin · July 18, 2017, 4:28am

Did you fix the problem in the end? BTW, I try to change your code with cuda. But I am not able to reproduce the invalid next size error or the segmentation fault. The code in the repo seems fine to me except with unstable training, which makes senses. I guess @smth have fixed the stability issue. But current whl on pytorch.org does not have this fix, isn’t it @smth?

Marvin · July 18, 2017, 4:29am

I’ve met something similar. Could you take a look at this Possible data parallel memory leak for siamese network ?

hadikazemi · July 31, 2017, 3:12pm

You can find a short tutorial here:
https://github.com/hadikazemi/Machine-Learning/tree/master/PyTorch