hey i have a question that might be weird.
given a model, either initialized with some weights or pretrained, I want to do the following:
- access weight of torch.nn.Module model m, w0
- w1 = f(w0, u), do modification of weight based on aux var u
- compute grad w.r.t u, namely dL(m(w1)))/dx
works, because state_dict does not allow gradient to pass through, and you can’t assign value to parameters in model.parameters() as model.parameters() are rather copies of the parameter, assign to them does not change the value of parameters used by the model.
any help would be appreciated, thanks in advance!