MiniBatch size by iteration

ptrblck · March 18, 2021, 9:32am

Your code looks correct, but you might want to divide the accumulated loss by the number of accumulation steps. Also, here is a nice overview of different approaches in case you want to trade compute for memory etc.