Loss.backward() error when using M1 GPU (mps device)

johnnyhuw · May 22, 2022, 5:49am

I want to train a Seq2Seq model with M1 GPU. The source code runs well with cpu. However, if I change the device from cpu to mps:

device = torch.device("mps")

RuntimeError occurs when training model:

RuntimeError: Expected a proper Tensor but got None (or an undefined Tensor in C++) for argument #0 'grad_y'

liuquartz · July 6, 2022, 3:24pm

Hi, have you got the sulotion?

aogara · September 8, 2022, 12:50am

I am having this problem too, and cannot find any information on how to solve it.

SamedAli · December 21, 2022, 9:44am

same error, just can’t find a way to fix it.
Tried

mikesol · March 15, 2023, 2:15pm

Same issue here as well, it looks like it is due to the LSTM in my network. I tried a GRU too but to no avail. CPU is fine, cuda is fine on Linux.