in the module, you’re registering a tensor logit_parameter, i.e. a dependent entity, parameters that create it are invisible to the optimizer (not in model.parameters())
in the module, you’re registering a tensor logit_parameter, i.e. a dependent entity, parameters that create it are invisible to the optimizer (not in model.parameters())