Training loss plateau ? fix?

anantguptadbl · February 12, 2025, 5:46pm

@Sourabh_Yadav unfortunately there is no one solution that will solve your problem. You have to create creative and experiment a few things

Start with lr=0.01 and then decrease it by order of 10 gradually to understand whether the minima is being found
batch_size=2 is a big red flag. Your model is not getting enough data points to update weights in the back prop. You can start with a higher batch size and slowly decrease it once you have smoothened out the convergence

There are many more factors and you can find many articles and discussion threads for the same