With reference to the above diagram, we have a use-case where we are creating separate datapipes for each label_classes
.
Let’s say we have an Image101.jpg
Now this Image101.jpg has been annotated with label_cls_1
and label_cls_2
i.e.,
Image101.jpg will be in result-set of DB query for both datapipe_1
and datapipe_3
as well.
Now, How can we make sure that after random_split()
Image101.jpg would come either under TRAIN set or TEST set for both datapipe_1
and datapipe_3
i.e.,
If Image101.jpg comes in test_datapipe_1
after random_shuffle()
then, it should also come under test_datapipe_3
and not train_datapipe_3
What would be the recommended way for this?