Hi a have a simple question about the memory usage of PyTorch models.
I have the following code:
print('1', process.memory_info().rss / 1e+9)
model = Task()
print('2', process.memory_info().rss / 1e+9)
model.to('cuda')
print('3', process.memory_info().rss / 1e+9)
This code has the following output:
1 0.268644352
2 0.269606912
3 2.501251072
Can somebody explain to me what is going on here? In print('2...
am I using the wrong function to check memory info? It seems to me like process.memory_info().rss
does not recognize the memory usage until the model weights are sent to my GPU.