Debugging CUDA out of memory (16GB GPU, 14+ GB reserved)

Besides the parameters and inputs intermediate forward activations could allocate a lot of memory as explained in e.g. this post.

1 Like