Facing following error when building a library with CUDA from source

  Compiling objects...
  Allowing ninja to set a default number of workers... (overridable by setting the environment variable MAX_JOBS=N)
  [1/10] /root/anaconda3/bin/nvcc  -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/cuda_utils_kernels.cu -o /tmp/tmpnuewswtl.build-temp/csrc/cuda_utils_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
  [2/10] c++ -MMD -MF /tmp/tmpnuewswtl.build-temp/csrc/pybind.o.d -pthread -B /root/anaconda3/compiler_compat -DNDEBUG -fwrapv -O2 -Wall -fPIC -O2 -isystem /root/anaconda3/include -fPIC -O2 -isystem /root/anaconda3/include -fPIC -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/pybind.cpp -o /tmp/tmpnuewswtl.build-temp/csrc/pybind.o -g -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
  FAILED: /tmp/tmpnuewswtl.build-temp/csrc/pybind.o
  c++ -MMD -MF /tmp/tmpnuewswtl.build-temp/csrc/pybind.o.d -pthread -B /root/anaconda3/compiler_compat -DNDEBUG -fwrapv -O2 -Wall -fPIC -O2 -isystem /root/anaconda3/include -fPIC -O2 -isystem /root/anaconda3/include -fPIC -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/pybind.cpp -o /tmp/tmpnuewswtl.build-temp/csrc/pybind.o -g -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
  c++: fatal error: Killed signal terminated program cc1plus
  compilation terminated.
  [3/10] /root/anaconda3/bin/nvcc  -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/quantization/gptq/q_gemm.cu -o /tmp/tmpnuewswtl.build-temp/csrc/quantization/gptq/q_gemm.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
  FAILED: /tmp/tmpnuewswtl.build-temp/csrc/quantization/gptq/q_gemm.o
  /root/anaconda3/bin/nvcc  -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/quantization/gptq/q_gemm.cu -o /tmp/tmpnuewswtl.build-temp/csrc/quantization/gptq/q_gemm.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
  Killed
  Killed
  Killed
  Killed
  Killed
  Killed
  [4/10] /root/anaconda3/bin/nvcc  -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/quantization/squeezellm/quant_cuda_kernel.cu -o /tmp/tmpnuewswtl.build-temp/csrc/quantization/squeezellm/quant_cuda_kernel.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
  FAILED: /tmp/tmpnuewswtl.build-temp/csrc/quantization/squeezellm/quant_cuda_kernel.o
  /root/anaconda3/bin/nvcc  -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/quantization/squeezellm/quant_cuda_kernel.cu -o /tmp/tmpnuewswtl.build-temp/csrc/quantization/squeezellm/quant_cuda_kernel.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
  Killed
  Killed
  Killed
  Killed
  Killed
  Killed
  [5/10] /root/anaconda3/bin/nvcc  -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/activation_kernels.cu -o /tmp/tmpnuewswtl.build-temp/csrc/activation_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
  FAILED: /tmp/tmpnuewswtl.build-temp/csrc/activation_kernels.o
  /root/anaconda3/bin/nvcc  -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/activation_kernels.cu -o /tmp/tmpnuewswtl.build-temp/csrc/activation_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
  Killed
  Killed
  Killed
  Killed
  Killed
  Killed
  [6/10] /root/anaconda3/bin/nvcc  -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/pos_encoding_kernels.cu -o /tmp/tmpnuewswtl.build-temp/csrc/pos_encoding_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0  FAILED: /tmp/tmpnuewswtl.build-temp/csrc/pos_encoding_kernels.o
  /root/anaconda3/bin/nvcc  -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/pos_encoding_kernels.cu -o /tmp/tmpnuewswtl.build-temp/csrc/pos_encoding_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
  Killed
  Killed
  Killed
  Killed
  Killed
  Killed
 .
 .
 .
  [9/10] /root/anaconda3/bin/nvcc  -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/layernorm_kernels.cu -o /tmp/tmpnuewswtl.build-temp/csrc/layernorm_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
  [10/10] /root/anaconda3/bin/nvcc  -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/quantization/awq/gemm_kernels.cu -o /tmp/tmpnuewswtl.build-temp/csrc/quantization/awq/gemm_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
  /vllm-ra/csrc/quantization/awq/gemm_kernels.cu(24): warning #177-D: function "vllm::awq::__pack_half2" was declared but never referenced
    __pack_half2(const half x, const half y) {
    ^

  Remark: The warnings can be suppressed with "-diag-suppress <warning-number>"

  /vllm-ra/csrc/quantization/awq/gemm_kernels.cu(40): warning #177-D: variable "scaling_factors_shared" was declared but never referenced
      __attribute__((shared)) half scaling_factors_shared[128];
                                   ^

  Remark: The warnings can be suppressed with "-diag-suppress <warning-number>"

  /vllm-ra/csrc/quantization/awq/gemm_kernels.cu(41): warning #177-D: variable "zeros_shared" was declared but never referenced
      __attribute__((shared)) half zeros_shared[128];
                                   ^

  /vllm-ra/csrc/quantization/awq/gemm_kernels.cu(277): warning #177-D: variable "blockIdx_x" was declared but never referenced
      int blockIdx_x = 0;
          ^

  /vllm-ra/csrc/quantization/awq/gemm_kernels.cu(291): warning #177-D: variable "ld_zero_flag" was declared but never referenced
      bool ld_zero_flag = (threadIdx.y * 32 + threadIdx.x) * 8 < 64;
           ^

  /vllm-ra/csrc/quantization/awq/gemm_kernels.cu(24): warning #177-D: function "vllm::awq::__pack_half2" was declared but never referenced
    __pack_half2(const half x, const half y) {
    ^

  ninja: build stopped: subcommand failed.
  Traceback (most recent call last):
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 2100, in _run_ninja_build
      subprocess.run(
    File "/root/anaconda3/lib/python3.11/subprocess.py", line 571, in run
      raise CalledProcessError(retcode, process.args,
  subprocess.CalledProcessError: Command '['ninja', '-v']' returned non-zero exit status 1.

  The above exception was the direct cause of the following exception:

  Traceback (most recent call last):
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/command/editable_wheel.py", line 155, in run
      self._create_wheel_file(bdist_wheel)
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/command/editable_wheel.py", line 357, in _create_wheel_file
      files, mapping = self._run_build_commands(dist_name, unpacked, lib, tmp)
                       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/command/editable_wheel.py", line 280, in _run_build_commands
      self._run_build_subcommands()
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/command/editable_wheel.py", line 307, in _run_build_subcommands
      self.run_command(name)
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/_distutils/cmd.py", line 318, in run_command
      self.distribution.run_command(command)
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/dist.py", line 967, in run_command      super().run_command(command)
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/_distutils/dist.py", line 988, in run_command
      cmd_obj.run()
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/command/build_ext.py", line 91, in run
      _build_ext.run(self)
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/_distutils/command/build_ext.py", line 345, in run
      self.build_extensions()
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 873, in build_extensions
      build_ext.build_extensions(self)
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/_distutils/command/build_ext.py", line 467, in build_extensions
      self._build_extensions_serial()
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/_distutils/command/build_ext.py", line 493, in _build_extensions_serial
      self.build_extension(ext)
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/command/build_ext.py", line 252, in build_extension
      _build_ext.build_extension(self, ext)
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/_distutils/command/build_ext.py", line 548, in build_extension
      objects = self.compiler.compile(
                ^^^^^^^^^^^^^^^^^^^^^^
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 686, in unix_wrap_ninja_compile
      _write_ninja_file_and_compile_objects(
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 1774, in _write_ninja_file_and_compile_objects
      _run_ninja_build(
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 2116, in _run_ninja_build
      raise RuntimeError(message) from e
  RuntimeError: Error compiling objects for extension
  /tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/_distutils/dist.py:988: _DebuggingTips: Problem in editable installation.
  !!

          ********************************************************************************
          An error happened while installing `vllm` in editable mode.

          The following steps are recommended to help debug this problem:

          - Try to install the project normally, without using the editable mode.
            Does the error still persist?
            (If it does, try fixing the problem before attempting the editable mode).
          - If you are using binary extensions, make sure you have all OS-level
            dependencies installed (e.g. compilers, toolchains, binary libraries, ...).
          - Try the latest version of setuptools (maybe the error was already fixed).
          - If you (or your project dependencies) are using any setuptools extension
            or customization, make sure they support the editable mode.

          After following the steps above, if the problem still persists and
          you think this is related to how setuptools handles editable installations,
          please submit a reproducible example
          (see https://stackoverflow.com/help/minimal-reproducible-example) to:

              https://github.com/pypa/setuptools/issues

          See https://setuptools.pypa.io/en/latest/userguide/development_mode.html for details.
          ********************************************************************************

  !!
    cmd_obj.run()
  Traceback (most recent call last):
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 2100, in _run_ninja_build
      subprocess.run(
    File "/root/anaconda3/lib/python3.11/subprocess.py", line 571, in run
      raise CalledProcessError(retcode, process.args,
  subprocess.CalledProcessError: Command '['ninja', '-v']' returned non-zero exit status 1.

  The above exception was the direct cause of the following exception:

  Traceback (most recent call last):
    File "/root/anaconda3/lib/python3.11/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 353, in <module>
      main()
    File "/root/anaconda3/lib/python3.11/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 335, in main
      json_out['return_val'] = hook(**hook_input['kwargs'])
                               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
    .
    .
    .
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/command/build_ext.py", line 252, in build_extension
      _build_ext.build_extension(self, ext)
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/_distutils/command/build_ext.py", line 548, in build_extension
      objects = self.compiler.compile(
                ^^^^^^^^^^^^^^^^^^^^^^
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 686, in unix_wrap_ninja_compile
      _write_ninja_file_and_compile_objects(
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 1774, in _write_ninja_file_and_compile_objects
      _run_ninja_build(
    File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 2116, in _run_ninja_build
      raise RuntimeError(message) from e
  RuntimeError: Error compiling objects for extension
  error: subprocess-exited-with-error

  × Building editable for vllm (pyproject.toml) did not run successfully.
  │ exit code: 1
  ╰─> See above for output.

  note: This error originates from a subprocess, and is likely not a problem with pip.
  full command: /root/anaconda3/bin/python3 /root/anaconda3/lib/python3.11/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py build_editable /tmp/tmpmib0bf78
  cwd: /vllm-ra
  Building editable for vllm (pyproject.toml): finished with status 'error'
  ERROR: Failed building editable for vllm
  Building wheel for quantile-python (setup.py): started
  Running command python setup.py bdist_wheel
  running bdist_wheel
  running build
  running build_py
  creating build
  creating build/lib
  creating build/lib/quantile
  copying quantile/__init__.py -> build/lib/quantile
  /root/anaconda3/lib/python3.11/site-packages/setuptools/_distutils/cmd.py:66: SetuptoolsDeprecationWarning: setup.py install is deprecated.
  !!
  !!
    self.initialize_options()
  installing to build/bdist.linux-x86_64/wheel
  running install
  running install_lib
  creating build/bdist.linux-x86_64
  creating build/bdist.linux-x86_64/wheel
  creating build/bdist.linux-x86_64/wheel/quantile
  copying build/lib/quantile/__init__.py -> build/bdist.linux-x86_64/wheel/quantile
  running install_egg_info
  running egg_info
  writing quantile_python.egg-info/PKG-INFO
  writing dependency_links to quantile_python.egg-info/dependency_links.txt
  writing top-level names to quantile_python.egg-info/top_level.txt
  reading manifest file 'quantile_python.egg-info/SOURCES.txt'
  writing manifest file 'quantile_python.egg-info/SOURCES.txt'
  Copying quantile_python.egg-info to build/bdist.linux-x86_64/wheel/quantile_python-1.1-py3.11.egg-info
  running install_scripts
  creating build/bdist.linux-x86_64/wheel/quantile_python-1.1.dist-info/WHEEL
  creating '/tmp/pip-wheel-w_wu8np5/quantile_python-1.1-py3-none-any.whl' and adding 'build/bdist.linux-x86_64/wheel' to it
  adding 'quantile/__init__.py'
  adding 'quantile_python-1.1.dist-info/METADATA'
  adding 'quantile_python-1.1.dist-info/WHEEL'
  adding 'quantile_python-1.1.dist-info/top_level.txt'
  adding 'quantile_python-1.1.dist-info/RECORD'
  removing build/bdist.linux-x86_64/wheel
  Building wheel for quantile-python (setup.py): finished with status 'done'
  Created wheel for quantile-python: filename=quantile_python-1.1-py3-none-any.whl size=3442 sha256=0240ef85c160b3877c9a728b8e9272ae4375e2ab04d6124a9fb1d0c00626f4e0
  Stored in directory: /root/.cache/pip/wheels/67/a2/17/29e7169adf03a7e44b922abb6a42c2c1b0fda11f7bfbdb24a2
Successfully built quantile-python
Failed to build vllm
ERROR: Could not build wheels for vllm, which is required to install pyproject.toml-based projects

Killed usually indicates a SIGKILL signal sent by the OS if you are running out of memory. Reduce the number of jobs and try to rebuild PyTorch.