How to save GPU memory?

fanq15 · June 12, 2017, 10:21am

I have tried many methods in our forum.
Such as
torch.backends.cudnn.benchmark = True,
torch.backends.cudnn.enabled = True,
make my input Variables have volatile=True, (High GPU Memory Demand for pytorch?);
and del loss, output at the end of loop (GPU memory consumption increases while training).
But all of them can’t save GPU memory in my experiment!
I use DataParallel, and construct a new network for my specific task.
Could anyone help me with it?

miguelvr · June 12, 2017, 10:48am

Going back to basics: have you simply tried reducing mini-batch size?

fanq15 · June 12, 2017, 1:03pm

I want to use other methods saving GPU memory to use bigger mini-batch size.

apaszke · June 12, 2017, 6:22pm

If you’re fine-tuning a network, then we’re aware of memory regressions in 0.1.12. It’s already fixed in master, so you can either install from source or wait for the next release (should happen this week).

fanq15 · June 13, 2017, 1:11am

Yes, I am fine-tuning a network!
Thank you very much!
I am looking forward to the new release!