How to change the Adam Optimizer's parameters when adding neurons

Joe_Harrison · May 11, 2019, 2:33pm

I’ve written code that adds hidden neurons to each hidden layer during training. I’m using the Adam optimizer which maintains a learning-rate for each parameter. At the moment I create a new optimizer with the new model parameters whenever nodes are added to the hidden layers. By doing this I lose the learning-rates for each parameter that was present before the addition of neurons.

How do I keep the same learning-rates per parameter? Is the correct way of doing this adding rows and columns of zeros to the parameters, exp_avg_sq and exp_avg or is there an easier way?

acobobby · May 11, 2019, 3:35pm

You can add new units to your optimizer and keep same hyperparameters without recreating it by using add_param_group like this:

optimizer.add_param_group({'params': [p for p in new_units.parameters()]})

where new_units in this case is a list of modules containing parameters.
Look here for the official docs.

Hope this helps.

Joe_Harrison · May 13, 2019, 8:11am

There are no new modules. The neurons added are to existing layers, not new layers. When I print the network parameters they show up in the same parameter group.

MariosOreo · May 13, 2019, 12:37pm

Hello,

Did you get easier method rather than padding?