as I understand it, this subtracts the coefficients by (learning rate * gradient)β¦
But why is it necessary to subtract?
Thanks
as I understand it, this subtracts the coefficients by (learning rate * gradient)β¦
But why is it necessary to subtract?
Thanks