Code works but the model doesn't train

mrswjung · October 27, 2021, 5:33am

I trained a model

the code works but the model loss doesn’t get better.

the training loss looks like this.

What is usually the reason when this type of problem occurs??

my3bikaht · October 27, 2021, 5:56am

With loss this big you need extremely small learning rate, otherwise your weights will change at crazy amounts. Seems like you are adding loss across samples/batches instead of averaging it.