Parameter: weight decay- optimizer ADAM

The weight_decay parameter adds a L2 penalty to the cost which can effectively lead to to smaller model weights.