Fastest way to load vectors on-the-fly for training

Yes exactly, what I do is that I shuffle the file load order between each epoch and in my case it’s good enough. If you want something closer to fully random data shuffling, you could shuffle the big tensor just after you load it from the file (following Shuffling a Tensor ). I think it should be sufficient.