NN layers and autograd.function

nirandaperera · March 9, 2020, 5:09pm

Hi,

I was wondering if all the nn layers have a corresponding torch.autograd.Function associated with them (in python side)?

Best

albanD · March 10, 2020, 6:36pm

Hi,

No they don’t. Even at the cpp level, some of them are implemented with a single elementary Function but many of them have multiple.

nirandaperera · March 10, 2020, 11:56pm

Thanks @albanD.

BTW I have been trying to figure out how exactly pytorch constructs the backward graph?
Is it appends to a tree data structure while going through the forward pass?
or does each tensor holds meta data about how they were originated (I guessing this could be grad_func) and from which tensors and backtracks until reaching a source node?

albanD · March 11, 2020, 12:09am

Yes it’s a tree data structure
You can check this graph using this package: https://github.com/szagoruyko/pytorchviz/

nirandaperera · March 11, 2020, 12:51am

Is it though?

Because when I checked the code (https://github.com/szagoruyko/pytorchviz/blob/46add7f2c071b6d29fc3d56e9d2d21e1c0a3af1d/torchviz/dot.py#L56), It felt like, there isn’t a data structure per se, but each tensor keeps a list of functions or something like that? Isn’t that how this graph is built in the pytorchviz package?

albanD · March 11, 2020, 6:30pm

It is an implicit tree We don’t have a centralized structure for it.

nirandaperera · March 11, 2020, 7:47pm

Exactly! thanks!

nirandaperera · March 12, 2020, 4:33pm

@albanD I have another follow up question.

I was wondering if the forward activations are exposed in the nn.Module level?
I have seen activations need to be explicitly saved to the ctx if we are using the autograd.Function. But I am wondering how PT would keep the activations when someone’s using the nn.Module API?

albanD · March 12, 2020, 5:29pm

The autograd engine works below the nn.Module level.
So if you use regular python function or nn.Module, the autograd does not see any difference and will save the required values properly.