How to implement dynamic or cost-aware data prefetching logic using pytorch dataloader?

Ptuser400 · January 8, 2026, 8:22am

I am pretty new to pytorch, but was just wondering if it is possible to implement cost-aware data prefetching in pytorch for maximum gpu utilization. I.e. not just statically populating prefetch_factor but rather making it more batch and cost aware. Any ideas or frameworks that have implemented this kind of staff?

Soumya_Kundu · January 9, 2026, 4:51pm

Is it something similar to this: GitHub - Rahm-no/MinatoLoader: Artifact for EuroSys'26 paper "MinatoLoader: Accelerating Machine Learning Training Through Efficient Data Preprocessing"