Nesterov's accelerated gradient + Adaptive learning rate with time decay factor

Ambareesh_S_J · June 25, 2019, 1:19pm

Training a DAE with an adaptive learning rate with a time decay factor(0.99) and Nesterov’s accelerated gradient. Is there a direct PyTorch optimizer option or a hack which lets me do this?

blackbirdbarber · June 25, 2019, 1:45pm

What do you mean torch.optim.SGD uses Nestorov momentum already?