Is the NAG implementation right?

Hi everybody,

I’m curious about how https://github.com/pytorch/pytorch/blob/master/torch/optim/sgd.py#L95 could be same with http://www.cs.toronto.edu/~fritz/absps/momentum.pdf ? Could anyone kindly tell me?

Thanks for your help!