I want to use a scale pyramid to extract resize invariant features for an image similarity project. I’m using resnet18 trained with triplet loss. I want to extract features at multiple scales and average pool them.
But when I pass the image tensor through kornia.geometry.transform ScalePyramid, it is returning images with shape B,C,NL,H,W.
I’m not sure how to understand the NL dimension. I need tensors in the shape B,C,H,W. How can I drop this NL dimension?
I also want to train a network that is invariant to scale. Can I use this scale pyramid during training? If yes, how to accommodate the NL dimension.