Mapping of old pixels to new using torchvision.transforms.functional.rotate

M_S · December 8, 2018, 10:36am

Hi,
I have my own dataset of images, and labels of objects per image, each label is described as a set of (x,y) points forming a convex polygon.
I want to implement my own augmentation.
Specifically, I want to rotate using torchvision.transforms.functional.rotate(..., expand=True).
However I also need the transformation function, so I can apply it on my labels and get the new set of points defining the polygon of the object.
Is there a way, after calling rotate, to have the transformation map from old pixels to new pixels?

Amrit_Das · December 8, 2018, 10:42am

@M_S would suggest instead of using the torch.utils for transforms, you can design your own utils.py with self designed transformations. You can import it and use it accordingly and should be able to help you out with you problem, ill add a link as an example -

github.com

kuangliu/pytorch-retinanet/blob/master/transform.py

'''Perform transforms on both PIL image and object boxes.'''
import math
import random

import torch
import torchvision.transforms as transforms

from PIL import Image, ImageDraw


def resize(img, boxes, size, max_size=1000):
    '''Resize the input PIL image to the given size.

    Args:
      img: (PIL.Image) image to be resized.
      boxes: (tensor) object boxes, sized [#ojb,4].
      size: (tuple or int)
        - if is tuple, resize image to the size.
        - if is int, resize the shorter side to the size while maintaining the aspect ratio.
      max_size: (int) when size is int, limit the image longer size to max_size.

This file has been truncated. show original

Give it a look, Using your own file gives you more freedom to mess around with things.

M_S · December 8, 2018, 10:46am

Thanks for the answer,
That will require me to calculate the mapping function of a rotation with expanding by myself - my question was if such implementation already exists in the library (doesn’t have to be PyTorch native code but preferred as the rest of my augmentations are such)

Amrit_Das · December 8, 2018, 10:53am

Its very simple you just have to create a rotation matrix :

or you can check this post for suggested codes:

M_S · December 8, 2018, 12:12pm

Yes, but as I mentioned that I require expansion, there is a translation followed after the rotation.
A solution I had in mind is to first pad the original images alot, which will then allow me to drop out expanding, and then indeed an easy calculation with a rotation matrix.
However I am still looking for an existing solution for this problem. If there’s no such, then I’ll implement…

Marat · April 16, 2020, 1:16pm

There is a weird way (I soon will try it) to do so. Make a x,y grid transform it to PIL image and apply reverse transform for that grids and vuala you get mapping from old coordinates to new.

        w, h = img.width, img.height

        assert w == h

        grid_x = torch.arange(w).repeat(h, 1).view([h, w]).type(torch.FloatTensor) / w
        grid_y = torch.arange(h).repeat(w, 1).t().view([h, w]).type(torch.FloatTensor) / h
        dummy = torch.zeros((h, w))
        grid_xy = torch.stack([grid_x, dummy, grid_y])
        grid_xy = transforms.ToPILImage()(grid_xy)

        rot = 30
        img = img.rotate(rot, Image.NEAREST, expand=0)
        grid_xy = grid_xy.rotate(-rot, Image.NEAREST, expand=0)

        grid_xy.show()
        img.show()

        grid_xy = transforms.ToTensor()(grid_xy)
        x_orig = label[:, 2] * w
        y_orig = label[:, 3] * h

        label[:, 2] = grid_xy[0, y_orig, x_orig]
        label[:, 3] = grid_xy[2, y_orig, x_orig]

The most weird thing here is that you may apply ant number of transformations whats so ever without any geometry headache the only thing you care about is keep transforming grid_xy in reverse order
Enjoy!