How to use torch.cuda.streams?

torch.cuda.stream(your_stream) do what you want, I guess :wink:
And you can use multiprocess to launch the different process with different streams. Note that multiprocess with cuda is only suported in Python 3