during the back-propagation there is this step
" return torch._C._nn.mse_loss(expanded_input, expanded_target, _Reduction.get_enum(reduction))"
and the error is
RuntimeError: iter.device(arg).is_cuda() INTERNAL ASSERT FAILED at “/opt/conda/conda-bld/pytorch_1603728993639/work/aten/src/ATen/native/cuda/Loops.cuh”:94, please report a bug to PyTorch.
Could you rerun your code with CUDA_LAUNCH_BLOCKING=1 python script.py args and check, if you get any error message?
If so, could you post it here please?
Also, make sure to use the latest PyTorch version.
You could try to use os.environ to set this env var at the beginning of the notebook.
Note, that you would have to set it before importing PyTorch, as it won’t have any effect otherwise.
Alternatively you could export the notebook as a Python script and run it in the terminal, which should work as well.
Error : “RuntimeError: iter.device(arg).is_cuda() INTERNAL ASSERT FAILED at “/opt/conda/conda-bld/pytorch_1603728993639/work/aten/src/ATen/native/cuda/Loops.cuh”:94, please report a bug to PyTorch.”
OK, I think the blocking launch didn’t work or maybe Jupyter is not forwarding the stack trace.
Could you post an executable code snippet to reproduce this issue and give us information about your setup, i.e. PyTorch CUDA, cublas version, how did you install it, which GPU are you using?