Can't convert CUDA tensor to numpy. Use Tensor.cpu() to copy the tensor to host memory first

Double post from here.