Why not use detach when freezing?

11152 · February 2, 2020, 6:20pm

def set_parameter_requires_grad(model, feature_extracting):
    if feature_extracting:
        for param in model.parameters():
            param.requires_grad = False

Why use only require_grad = False?

JuanFMontesinos · February 2, 2020, 8:13pm

Detach requires hard coding inside the model.
Detach requires knowledge about how layers are connected.
On the other side requires_grad false is external code and does not require any knowledge about model graph.
You can just not to pass layers to the optimizer, but there are drawbacks too (gradients keep acumulating)

11152 · February 3, 2020, 4:04am

Does detach automatically change to require_grad = False?

JuanFMontesinos · February 3, 2020, 10:28am

Detach does not change requires grad. Detach is a tensor’s method which breaks model’s graph when used.

11152 · February 3, 2020, 10:49am

Thank you very much !!