Larger than GPU memory tensors

lmoss · July 13, 2019, 12:44pm

Hi all,

Is there an elegant way to apply a network to a tensor that is larger than GPU memory?
Tensorflow has tensorflow-mesh, maybe there’s something similar for pytorch?

I am aware of a sliding window approach, but that can lead to artifacts at edges of outputs.

Thanks!

smth · July 14, 2019, 4:10pm

have you looked at something like https://pytorch.org/docs/stable/checkpoint.html

lmoss · July 17, 2019, 9:46pm

I think checkpoint_sequential will do the trick, thanks!