Is Pytorch autograd tape based?

LapoFrati · February 24, 2018, 4:55pm

In the documentation (and many other places online) is stated that autograd is tape based:

but in Paszke, Adam, et al. “Automatic differentiation in PyTorch.” (2017) is clearly stated:

So I guess it’s not?

tom · February 24, 2018, 7:20pm

I think what this means is that PyTorch and Chainer are smarter than “add everything to a global giant list” in that they store a graph of operations and do differentiate between paths where they need to backprop through and stuff they don’t need - for example if you are finetuning a convnet by just replacing the final layer, only the graph for this last bit from final layer to loss will be saved and backpropagated through.
Also, you can use the “selective recording” (my unscientific language) for neat things like differentiating implicit functions and trading compute for memory.
Quite likely, the nuances of no tape vs. advanced, modern interpretation of “tape” are as opaque to laypersons like me as they are clear to experts in the field, but one would perhaps expect that in 2017 you know a trick or two about great data structures and thread safety and stuff beyond what Wengert had in 1964.

Best regards

Thomas

SimonW · February 24, 2018, 7:26pm

In pytorch, there is no traditional sense of tape. In the engine, we queue up the backward jobs as soon as all its dependencies are satisfied. So it is not reversing a sequence of operations, but still executing a topological sorted order. This way we can use multi thread to execute these tasks easily (if they don’t conflict with one another).

sakaia · July 24, 2018, 11:51am

In README.md, Is the following description appropriate?

a tape-based automatic differentiation library that supports all differentiable Tensor operations in torch

github.com

pytorch/pytorch/blob/v0.4.0/README.md

<p align="center"><img width="40%" src="docs/source/_static/img/pytorch-logo-dark.png" /></p>

--------------------------------------------------------------------------------

PyTorch is a Python package that provides two high-level features:
- Tensor computation (like NumPy) with strong GPU acceleration
- Deep neural networks built on a tape-based autograd system

You can reuse your favorite Python packages such as NumPy, SciPy and Cython to extend PyTorch when needed.

We are in an early-release beta. Expect some adventures and rough edges.

- [More about PyTorch](#more-about-pytorch)
- [Installation](#installation)
  - [Binaries](#binaries)
  - [From Source](#from-source)
  - [Docker Image](#docker-image)
  - [Previous Versions](#previous-versions)
- [Getting Started](#getting-started)
- [Communication](#communication)

This file has been truncated. show original