Why to() failed?

fr1140m · June 9, 2019, 10:04am

I define a model, and use model.to(DEVICE) assign to GPU. but i always get errors below:

RuntimeError: Input type (torch.cuda.FloatTensor) and weight type (torch.FloatTensor) should be the same

This is my codes:
class BranchNet(nn.Module):
def init(self, dropout=DROPOUT, num_classes=NUM_CLASSES):
super(BranchNet, self).init()
…

DEVICE = torch.device(“cuda:2”)

model = BranchNet(dropout=dropout, num_classes=NUM_CLASSES)
model = model.to(DEVICE)
image = image.to(DEVICE)
label = label.to(DEVICE)

…

albanD · June 9, 2019, 10:44am

Hi,

The error is most likely not with the .to() op. It’s just that cuda is asynchronous and so errors will point to the wrong line. Run with CUDA_LAUNCH_BLOCKING=1 to make sure the error points to the right line.

Balamurali_M · June 9, 2019, 7:18pm

As far as I can see. This is nothing to do with to() method. In general, pytorch expects both the model and the input data to be of the same data type i.e float in our case. And if you are using GPU both the model weight and the input should be moved to gpu. Else the error mentioned will occur. As the first step check for the architecture class.

fr1140m · June 20, 2019, 5:21pm

Thank you!

CUDA_LAUNCH_BLOCKING is a ENV VARIABLE ?

fr1140m · June 20, 2019, 5:23pm

I have moved sample and model to same device already.

SimonW · June 20, 2019, 7:56pm

you likely did something wrong in your module code, e.g., not properly register parts as submodules.

@albanD CUDA is async, but out-of-kernel checks are not, since they are done without looking at the data contained. So CUDA_LAUNCH_BLOCKING won’t change things.

albanD · June 21, 2019, 7:34am

Ho right I read it too fast
Which line exactly causes the issue? I should help you know which module is to blame as simon said.