Torch.nn.utils.clip_grad.clip_grad_norm_ is too slow

There tf api tf.clip_by_global_norm is similar to [Torch.nn.utils.clip_grad.clip_grad_norm_ is too slow but it doesn’t make trainging slowerly.