Questions about using the reduce_scatter api in nccl.h. How should the input and output parameters in the api be defined?

Questions about using the reduce_scatter api in nccl.h. How should the input and output parameters in the api be defined?
Check the code: On line 752, const auto count = inputs[0].numel() / len;
In ncclReduceScatter, this count is required to be recv_count. They don’t seem to match.

It’s tricky. I found this tutorial to be extremely helpful when I was learning scatter(): Understand torch.scatter_(). First, note that scatter_() is an… | by Yu Yang | Medium

Thank you for your reply. My question is about the NCCL communication library, and it may be different from yours.

Yes, you’re right, sorry :slight_smile: