in your example, what will happen to gradients of self.base? will they be calculated taking into account both input1 and input2?
in your example, what will happen to gradients of self.base? will they be calculated taking into account both input1 and input2?