Clip_grad_norm_ return nan for the first iterations

I am using norm = torch.nn.utils.clip_grad_norm_(parameters, clip_grad, norm_type=2) to clip the gradient. However, in the first three iterations, norm=nan . After that, it has numerical values. I don’t understand what the problem is.