How to profile CUDA memory usage for each part of model

dancedpipi · August 13, 2021, 2:31am

referece to pytorch profiler, it seem only trace cpu memory instead of gpu memory, is there any tool to trace cuda memory usage for each part of model?

googlebot · August 13, 2021, 3:39am

Try GitHub - Stonesjtu/pytorch_memlab: Profiling and inspecting memory in pytorch, though it may be easier to just manually wrap some code blocks and measure usage deltas (of cuda.memory_allocated).

dancedpipi · August 13, 2021, 3:56am

Thanks for your reply, I’ll try it.
Is there a official pytorch profiler for gpu memory?

googlebot · August 13, 2021, 4:17am

afaik, it only has torch.profiler.profile(profile_memory=True) as an aggregator, I’m not sure if that produces useful results (there is undocumented autograd.profiler.record_function(“X”) to mark code blocks)…

dancedpipi · August 13, 2021, 5:11am

Thanks! torch.profiler.profile(profile_memory=True) seem only produce cpu memory usage, I might have to find another way

googlebot · August 13, 2021, 5:49am

there are options for cuda (version dependent, so check docs)