Multiple model outputs activate from a single model based on labels selection

I think your explanation is correct, although I’m not sure, what “closed” graph means exactly.
However, each operation in the forward pass will create a computation graph, which will be used to calculate the gradients for the involved parameters in the backward call.