Dear all, we recently want to training a new “resnet50” from scratch on pytorch. could everyone give some references to tell how setting the learning-rate, weight-decay. step or poly? thanks~
The hyper-parameters most likely depend on your use case and dataset.
However, I think the ImageNet example will give you good default values.