Are there tools / best practices for mixed
cuda debugging for torch CUDA Extensions (extensions built with
CUDAExtension)? I.e., I’d like to run / debug a PyTorch program from Python and then step into a debugger such as
cuda-gdb when the extension is called.
Any recommendations greatly appreciated!