Optimal data-reading for ImageNet from scratch

What’s the best way to format and train ImageNet from scratch? The official tutorial for ImageNet is really slow. Tensorflow suggests tfrecords, what’s the analogous approach for PyTorch? Does anyone have a standalone tutorial/example for doing this?

Thanks in advance,
Spandan