Torch.qr and torch.ones being created inside cpu instead of cuda?

lucastononrodrigues · January 7, 2021, 2:57pm

I was implementing a torch.qr inside a module of my creation with something like this:

self.q_transposed=torch.qr(nn.Parameter(torch.randn(self.m,self.n),requires_grad=False).T)[0].T

Even though the module device is cuda (I made sure of that), the parameter is in cpu for some reason.

The same thing is happening with torch.ones(): inside my forward method, I need to declare a torch.ones() tensor and then multiply it by another tensor (that is located in gpu) and, for some reason, this “ones” tensor goes directly to cpu.

The only solution I have found so far was to insert a device argument in the init and then add each parameter back to cuda when my matmul would not work. But I am afraid that this would compromise using resources such as nn.DataParallel, that are going to be useful for my application.

albanD · January 7, 2021, 3:07pm

Hi,

Yes all of the factory functions like randn or ones are independent of the nn.Module device.
So in general, you want to provide a device= argument to these functions if you want the Tensor to be created on a special device.
You can get the right device from any param or input of your Module.

Also, if you want Tensor attributes to be moved along with the parameters, you should register them as “buffers” on the Module with self.register_buffer("q_transposed", the_tensor) which you can then access as self.q_transposed.

Side not in your code sample, there is not need to wrap the Tensor into a Parameter here. Parameter are only important when you assign them to a nn.Module.

lucastononrodrigues · January 7, 2021, 3:44pm

Right, thank you.
However the torch.qr does not have a device argument, right?

albanD · January 7, 2021, 5:54pm

The output device will be the same as the input. So you just need to make sure the randn in there is on the right device