I think you would have to define the weight decay manually, e.g. as shown here, and filter out the parameters, which should not be added to weight decay.
1 Like
I think you would have to define the weight decay manually, e.g. as shown here, and filter out the parameters, which should not be added to weight decay.