Is there a Pytorch implementation of RAdam + LookAhead?

I was looking at the new paper by Geoffrey Hinton and was wondering if it’s been implemented. Look at the article:


Cross posted:

Here: https://github.com/mgrankin/over9000

RAdam+LA

from radam import RAdam
from optimizer import Lookahead
base_optim = RAdam(model.parameters(),lr = 0.001) 
optimizer =  Lookahead(base_optim, k=5, alpha=0.5)

RAdam | LAMB | Ralamb | Lookahead

2 Likes

The RAdam link seems broken (a 404 on github - maybe it’s not public?)

Anyway, this link works.