Pytorch distributed concurrent queue/buffer

ryuxin · July 22, 2020, 3:10am

Hi everyone,

I am wondering whether there is anyway for pytorch distributed to build one concurrent queue(or buffer) between parameter server and workers.

So that, every worker can work as a producer to send the msg to the concurrent queue.

And the parameter server can work as consumer to consume msg from concurrent queue.

Besides, parameter server can detect the length of the concurrent queue.

Thank you!

mrshenli · July 22, 2020, 3:55pm

Hey @ryuxin, can this be implemented as a wrapper on top of the RPC API? For example, can you implement the queuing logic as an RPC target function? Some related tutorials:

ryuxin · July 22, 2020, 4:33pm

Thanks for the hint!