What's the difference between Variable.detach() and Variable.clone()

richard · December 4, 2017, 4:21pm

To my understanding stop_gradient() in Tensorflow treats the thing as a constant. Sending in x.detach() to a nn layer will prevent gradients from being computed for x, so I believe the behavior is the same