I’m training a model composed of two fully connected layers with relus.
But I get a loss value of nan for each epoch (I’m using MSE loss).
I printed the weights and bias and I found some nan values.
Do you have any idea how can I resolve this problem?
Here is some examples of input signal I’m using:
Thanks in advance.
Could you check the input tensors for NaN or Inf values?
These values would create a NaN loss.
Also, how high is your learning rate?
Do you see the NaN loss right in the first batch?
I checked, the input tensors don’t have nan or inf values. It is composed of sparse signals.
I tried a learning rate between 1e-3 and 1e-5, but I get the same results.
Yes the nan loss begins from the first batch.
I’m using a training dataset of 5000 signals.
I noticed that when I decrease the number of signals to 50, I don’t get nan losses.
Are you normalizing the input data or what value range are you currently dealing with? Based on your information you might create an overflow somewhere.