Questions about overwriting variables in pytorch

garytho · January 23, 2018, 6:33pm

I’m confused about overwriting values in pytorch and when you should or should not (link to docs)

I would like some general suggestions if anyone has any, but here are a few examples that I’m confused about:

loss += criterion(output, target) #is overwriting loss acceptable?

or

for _ in range(5):
   output = model(input)  #Am I overwriting the output?
   loss += criterion(output, target)
loss.backward()

In the example above, I overwrite the Variable output multiple times in the for loop.

Am I “overwriting values required to compute gradients” with this in-place operation?
Does code like this make the autograd computational graph slower in some way?
What if I was using a Variable with thousands of elements instead of a scalar?

thank you

jpeg729 · January 23, 2018, 7:09pm

The line output = model(input) overwrites output in so far as it reassigns the name output to point to new data, in this case the return value of model(input). If some other code has kept a reference to the old data then python keeps the old data around. In this case the next line loss += criterion(output, target) uses the value of output to calculate the loss, and PyTorch updates the computation graph that it will use to calculate the gradients.
This sort of code shouldn’t make things slower at all given that output is just a name that points to a location of data stored in memory.
The size of the Variable is of no significance.