Why did my diffusion network with file size of only 7mb blow up the video memory of the A100?

Forward activations will most likely use the large amount of memory as explained here.