This is my pytorch version and my GPU is “NVIDIA GeForce RTX 4090”
When I try to train my model, Kernel dies with the below output
C:\actions-runner_work\pytorch\pytorch\builder\windows\pytorch\aten\src\ATen\native\cuda\IndexKernel.cu:93: block: [62,0,0], thread: [115,0,0] Assertion index >= -sizes[i] && index < sizes[i] && "index out of bounds" failed.
Need help !!
You would need to build from source using your locally installed CUDA toolkit. Alternatively, you can download the nightly binaries using CUDA 12.1 Update1.