Saving a Variable with requires_grad=True in a dictionary is not being updated

muammar · February 6, 2019, 4:23am

@ptrblck I found another problem in L-273, and I changed the class following your suggestions:


@@ -66,6 +66,7 @@ class NeuralNetwork(nn.Module):

         """

         X = torch.tensor(X, requires_grad=False)

+        X = X.unsqueeze(0)

         X = self.linears[symbol](X)

         intercept_name = 'intercept_' + symbol

@@ -164,6 +165,7 @@ class NeuralNetwork(nn.Module):

         for key in self.state_dict():

             old_state_dict[key] = self.state_dict()[key].clone()

+        targets = [[target] for target in targets]

         targets = torch.tensor(targets, requires_grad=False)

         # Define optimizer

@@ -196,7 +198,7 @@ class NeuralNetwork(nn.Module):

                 outputs.append(image_energy)

-            outputs = torch.cat(outputs)

+            outputs = torch.stack(outputs)

             loss, rmse = self.get_loss(outputs, targets, data.atoms_per_image)

             _loss.append(loss)

             _rmse.append(rmse)

@@ -272,9 +274,10 @@ class NeuralNetwork(nn.Module):

         """

         self.optimizer.zero_grad()  # clear previous gradients

-

+        atoms_per_image = [[number] for number in atoms_per_image]

         atoms_per_image = torch.tensor(atoms_per_image, requires_grad=False,

                                        dtype=torch.float)

+

         outputs_atom = torch.div(outputs, atoms_per_image)

         targets_atom = torch.div(targets, atoms_per_image)

With those changes, now there is a difference in the 6th decimal:


outputs

tensor([[[-14.5754384995]],

        [[-14.5754394531]]], grad_fn=&lt;StackBackward&gt;)

targets

tensor([[-14.5868730545],

        [-14.5640010834]])

Applying MinMaxScaler from sklearn to the features gave the same result.

muammar · February 20, 2019, 6:00pm

I finally solved the problem in this way:

I modified forward() to receive the whole data as an argument and operate over it to return the outputs tensor.
The features had to be also scaled as @ptrblck suggested. Thank you for your help, I really appreciate it.

The gist was updated :).

ptrblck · February 20, 2019, 11:30pm

Awesome to hear!
Looking forward to hear from your results in the near future!

jasperhyp · July 28, 2022, 3:56am

One trick seems to be working (at least the values are being updated):

self.value = nn.Parameter(value)
self.value_dict = {key:self.value}

And then the optimizer won’t need additional care, but just model.parameters().