Proper way to do gradient clipping?

Brando_Miranda · August 23, 2017, 4:15pm

for people trying to just get an answer quickly:

torch.nn.utils.clip_grad_norm(mdl_sgd.parameters(),clip)

or with in-place clamp:

W.grad.data.clamp_(-clip,clip)

also similar Q: