I’ve noticed that optim.optimizer has a variable named ‘param_group’, and it seems can divide a network into many groups and set different learning rate and optimization method for them for better training. But how can I divide a network into those groups? Original network seems is included by just one group. Thank you!
Have a look at the per-parameter option doc.