So I have working maskrcnn_benchmark working with torch==1.10.0+cu111.
Now I shift to NVIDIA H100 which are sm_90.
sm_90 requires CUDA 11.8 which is supported on torch 1.13.0 onwards.
torch 1.13.0 (or any other torch >1.11) have removed <THC/THC.h> which mean I have to patch the
maskrcnn_benchmark/csrc/cuda/
with <ATen/ATen.h>
following steps here : Missing headers in ATen/cuda/DeviceUtils.cuh · Issue #72807 · pytorch/pytorch · GitHub
But once I do all of this I get
maskrcnn_benchmark/csrc/cuda/deform_pool_kernel_cuda.cu(245): error: no instance of overloaded function "atomicAdd" matches
the argument list
argument types are: (c10::Half *, c10::Half)
detected during instantiation of "void DeformablePSROIPoolBackwardAccKernel(int, const scalar_t *, const scalar_t *, int, scalar_t, int, int, int, int, int, int, scalar_t
*, scalar_t *, const scalar_t *, const scalar_t *, const scalar_t *, int, scalar_t, int, int, int, int, int) [with scalar_t=c10::Half]"
(347): here
What do I do now