Half Precision BackProp Taking up More RAM than Float

Henry_Chinner · June 8, 2020, 11:17am

I have a model that takes approx 8gb RAM. When I convert the model to half via model.half() the RAM blows up to 48GB during backward().

Has anyone encountered a similar problem?

albanD · June 8, 2020, 4:44pm

Hi,

Do you run this on CPU?
I don’t think the CPU actually has half precision implementation and so it falls back to single precision float. This might explain the memory increase

Henry_Chinner · June 8, 2020, 5:32pm

Hi,

It is running on GPU Tesla V100. It is running fine as float. Get’s in just under 16GB. But when I switch to half it blows up during the backward pass. The memory goes up to 48GB then.

albanD · June 8, 2020, 8:19pm

The CPU memory or the GPU memory? V100 don’t have 48GB of memory right?

Henry_Chinner · June 9, 2020, 5:49am

GPU memory. I get an errror message saying the GPU tried to allocate 48GB RAM and then crashes as it only has 16GB.

ptrblck · June 9, 2020, 7:24am

Could you post your model definition here so that we could reproduce this behavior, please?