Mike2004
(Mike Long)
#1
Hello, can someone explain me better, what the weight decay parameter in optimizer ADAM, does?
Thank you.
The weight_decay parameter adds a L2 penalty to the cost which can effectively lead to to smaller model weights.
https://dejanbatanjac.github.io/2019/07/02/Impact-of-WD.html
Mike2004
(Mike Long)
#3
Thank you for you time, now i understand!