How are folks using iterable datasets with DDP? The example for splitting an IterableDataset
across workers (or DDP processes) seems a little silly – if I had random access to my dataset (iter_start
), I wouldn’t be using an iterable dataset in the first place.
Has anyone come across / built a better solution?