You could accumulate the gradient for a few samples and call the optimizer after these steps.
This post might help.
1 Like
You could accumulate the gradient for a few samples and call the optimizer after these steps.
This post might help.