Siamese dataset Builder for large datasets

Hi,

Disclaimer: I’m new to pytorch.

I’ve been working on building a parallel data loader section (such that only the batchSize images from the workers are loaded onto memory) that feeds into a siamese network.

This is what I have so far:
https://gist.github.com/arBest/811f7df58c50495873d5eebc2c348552

Question:
I wanna go from
dst = SiameseDataset(pos_pairs_csv_path)

to
dst = SiameseDataset(pos_pairs_csv_path, neg_pairs_csv_path)

where if my batchSize = N, then batchSize = N/2 comes from pos_pairs file and N/2 comes from neg_pairs file.

Thanks in advance for the help!

Wrote something that iterates through pos and neg datasets alternatively.
Thanks,