Could you please post a minimum executable snippet that reproduces the error?
It shouldn’t be the case generally as model parameters are the leaf tensors in the computation graph of the loss provided it is calculated correctly.
Could you please post a minimum executable snippet that reproduces the error?
It shouldn’t be the case generally as model parameters are the leaf tensors in the computation graph of the loss provided it is calculated correctly.