Questions about using the reduce_scatter api in nccl.h. How should the input and output parameters in the api be defined?
Check the code: On line 752, const auto count = inputs[0].numel() / len;
In ncclReduceScatter, this count is required to be recv_count. They don’t seem to match.
It’s tricky. I found this tutorial to be extremely helpful when I was learning scatter()
: Understand torch.scatter_(). First, note that scatter_() is an… | by Yu Yang | Medium
Thank you for your reply. My question is about the NCCL communication library, and it may be different from yours.
Yes, you’re right, sorry