Long training time, bottleneck output shows high IO

If the data loading is the bottleneck, have a look at this post by @rwightman, which explains how these bottlenecks might be avoided.