How to implement accumulated gradient?

Hi smth,

So, eventually, there is no necessity to divide loss with iter_size?
I’m still bit confusing since apaszke mentioned about dividing here.

Thank you for your help.

1 Like