Custom dataset getitem return label as integer or tensor, return single data or a range of data?

Haizhuolaojisite · March 9, 2022, 3:10am

The custom dataset will return image in tensor and its label. My questions are:

What is the data format of label class? If return label as a tensor, which one is correct:

class_id = torch.tensor(class_id) --->dataloader return label size of [batch]
or 
class_id = torch.tensor([class_id])--->dataloader return label size of [batch, 1],here 1 is dimension of label

Can getitem method return a range of data points? I know dataset[0] return first element, but is dataset[2:10] feasible in custom dataset and dataloader? If feasible, how?

Can anyone help me if possible please? Thanks a lot! I look forward to any reply!!

eqy · March 9, 2022, 8:21am

I’m not sure it is expected that the label (if it is a class id) would be transformed to a tensor; see the canonical ImageFolder implementation here:
torchvision.datasets.folder — Torchvision main documentation

If I remember correctly [batch] should be fine and is expected for standard loss functions such as cross entropy loss: CrossEntropyLoss — PyTorch 1.10 documentation (see docs saying labels should have shape (N)).

Yes, potentially if __getitem__ implemented slicing e.g., python - Implementing slicing in __getitem__ - Stack Overflow. But for common use cases slicing is not needed or desirable since the typical approach is to rely on dataloaders torch.utils.data — PyTorch 1.10 documentation to automatically batch (and shuffle) your data.

Brando_Miranda · September 27, 2022, 10:39pm

this is what I’m doing. Isn’t it standard?

        # self.data, self.targets = self._load_data()
        del self.datasets
        self.target_transform = lambda data: torch.tensor(data, dtype=torch.int)

    def __len__(self):
        return len(self.data)

    def __getitem__(self, idx: int) -> tuple[Tensor, Tensor]:
        x = self.data[idx]
        y = self.indices_to_labels[idx]
        if self.target_transform is not None:
            y = self.target_transform(y)
        return x, y