Lets say I have datasets A and B. I want to receive at runtime batches that are either from A or from B, (not mixed batches) and also their origin. So some tuple like (‘A’, BatchFromA).
How to implement that?
To get the dataset name in each sample, you could return it in the __getitem__
of each Dataset
.
A custom sampler might be the best approach to create batches from unique datasets and to make sure they are not mixed.