Loading VOC 2012 dataset with Dataloaders

Gaurav_Pandey · March 1, 2017, 7:30am

VOC 2012 dataset consists of images and their corresponding segmentation maps. I want to apply similar transforms to both the image and its segmentation map while loading. Any suggestions about how to proceed for this task?

apaszke · March 1, 2017, 11:50am

We don’t have a ready solution implemented, but there has been some discussion in a torchvision issue.

ncullen93 · March 1, 2017, 3:06pm

@Gaurav_Pandey you can easily adapt a dataset to handle co_transforms in the __call__ function (e.g. see this gist which has general structure that handles co_transforms):

gist.github.com

https://gist.github.com/ncullen93/14b458cb4bd237bab2a41a185f710808

co_classes.py

"""
Custom datasets from both in-memory and out-of-memory data
"""

import torch.utils.data as data

from PIL import Image
import os
import os.path

This file has been truncated. show original

and here are some relevant affine transforms to actually use – you’ll see the transforms must take in two arguments for the input and target images:

gist.github.com

https://gist.github.com/ncullen93/425ca642955f73452ebc097b3b46c493

affine_transforms.py

"""
Affine transforms implemented on torch tensors, and
only requiring one interpolation

Included:
- Affine()
- AffineCompose()
- Rotation()
- Translation()
- Shear()

This file has been truncated. show original

Gaurav_Pandey · March 2, 2017, 6:49am

Thanks guys. That was very helpful.

bodokaiser · March 22, 2017, 10:03am

I also implemented a dataset for VOC2012.

isalirezag · July 9, 2018, 10:21pm

@apaszke does pytorch have any plan to add VOC dataset like what it has for coco?

HT_Wang · April 26, 2020, 11:42pm

Now Torchvision has VOC dataset implemented, but it’s not that user friendly. Specifically, it doesn’t allow co-transform (data augmentation such as rotation and scaling) and separate transforms (normalization on images) to exist at the same time (raise an error here). Many good third-party implementations like this one.