Reference for scale_grad_by_freq option in nn.Embedding

I found the scale_grad_by_freq option in the nn.Embedding module.

However, I couldn’t find the reference for that option.

Does anyone know which survey or paper used this technique?

Thank you.