Thing is, I am bottlenecked by network so I feel like I could speed things up by not having the worker wait on running a transform before starting the next download.
Is there a convenient way to do this with the Dataset/DataLoader APIs directly?
I’d first recommend having a separate method called when init’ing the Dataset to initially download the files instead of doing it on the fly, but another work around would be to avoid downloading each file at the very least: