by using torch.autograd.set_detect_anomaly(True) I found this error. My dataset seems ok so to resolve this issue should I use gradient clipping or just ignore ‘nan’ values using torch.isnan(x) ?
RuntimeError: Function 'SmoothL1LossBackward' returned nan values in its 0th output