[NEED HELP] RuntimeError: Expected to have finished reduction in the prior iteration before starting a new one

My first suspicion was probably this, but it does look like all outputs are participating in loss computation. Although, to double check this can you share the code for PartialFC since it is used in the loss computation?