Compiling objects...
Allowing ninja to set a default number of workers... (overridable by setting the environment variable MAX_JOBS=N)
[1/10] /root/anaconda3/bin/nvcc -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/cuda_utils_kernels.cu -o /tmp/tmpnuewswtl.build-temp/csrc/cuda_utils_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
[2/10] c++ -MMD -MF /tmp/tmpnuewswtl.build-temp/csrc/pybind.o.d -pthread -B /root/anaconda3/compiler_compat -DNDEBUG -fwrapv -O2 -Wall -fPIC -O2 -isystem /root/anaconda3/include -fPIC -O2 -isystem /root/anaconda3/include -fPIC -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/pybind.cpp -o /tmp/tmpnuewswtl.build-temp/csrc/pybind.o -g -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
FAILED: /tmp/tmpnuewswtl.build-temp/csrc/pybind.o
c++ -MMD -MF /tmp/tmpnuewswtl.build-temp/csrc/pybind.o.d -pthread -B /root/anaconda3/compiler_compat -DNDEBUG -fwrapv -O2 -Wall -fPIC -O2 -isystem /root/anaconda3/include -fPIC -O2 -isystem /root/anaconda3/include -fPIC -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/pybind.cpp -o /tmp/tmpnuewswtl.build-temp/csrc/pybind.o -g -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
c++: fatal error: Killed signal terminated program cc1plus
compilation terminated.
[3/10] /root/anaconda3/bin/nvcc -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/quantization/gptq/q_gemm.cu -o /tmp/tmpnuewswtl.build-temp/csrc/quantization/gptq/q_gemm.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
FAILED: /tmp/tmpnuewswtl.build-temp/csrc/quantization/gptq/q_gemm.o
/root/anaconda3/bin/nvcc -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/quantization/gptq/q_gemm.cu -o /tmp/tmpnuewswtl.build-temp/csrc/quantization/gptq/q_gemm.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
Killed
Killed
Killed
Killed
Killed
Killed
[4/10] /root/anaconda3/bin/nvcc -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/quantization/squeezellm/quant_cuda_kernel.cu -o /tmp/tmpnuewswtl.build-temp/csrc/quantization/squeezellm/quant_cuda_kernel.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
FAILED: /tmp/tmpnuewswtl.build-temp/csrc/quantization/squeezellm/quant_cuda_kernel.o
/root/anaconda3/bin/nvcc -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/quantization/squeezellm/quant_cuda_kernel.cu -o /tmp/tmpnuewswtl.build-temp/csrc/quantization/squeezellm/quant_cuda_kernel.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
Killed
Killed
Killed
Killed
Killed
Killed
[5/10] /root/anaconda3/bin/nvcc -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/activation_kernels.cu -o /tmp/tmpnuewswtl.build-temp/csrc/activation_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
FAILED: /tmp/tmpnuewswtl.build-temp/csrc/activation_kernels.o
/root/anaconda3/bin/nvcc -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/activation_kernels.cu -o /tmp/tmpnuewswtl.build-temp/csrc/activation_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
Killed
Killed
Killed
Killed
Killed
Killed
[6/10] /root/anaconda3/bin/nvcc -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/pos_encoding_kernels.cu -o /tmp/tmpnuewswtl.build-temp/csrc/pos_encoding_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0 FAILED: /tmp/tmpnuewswtl.build-temp/csrc/pos_encoding_kernels.o
/root/anaconda3/bin/nvcc -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/pos_encoding_kernels.cu -o /tmp/tmpnuewswtl.build-temp/csrc/pos_encoding_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
Killed
Killed
Killed
Killed
Killed
Killed
.
.
.
[9/10] /root/anaconda3/bin/nvcc -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/layernorm_kernels.cu -o /tmp/tmpnuewswtl.build-temp/csrc/layernorm_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
[10/10] /root/anaconda3/bin/nvcc -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/torch/csrc/api/include -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/TH -I/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/include/THC -I/root/anaconda3/include -I/root/anaconda3/include/python3.11 -c -c /vllm-ra/csrc/quantization/awq/gemm_kernels.cu -o /tmp/tmpnuewswtl.build-temp/csrc/quantization/awq/gemm_kernels.o -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr --compiler-options ''"'"'-fPIC'"'"'' -O2 -std=c++17 -D_GLIBCXX_USE_CXX11_ABI=0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_70,code=sm_70 --threads 8 -DTORCH_API_INCLUDE_EXTENSION_H '-DPYBIND11_COMPILER_TYPE="_gcc"' '-DPYBIND11_STDLIB="_libstdcpp"' '-DPYBIND11_BUILD_ABI="_cxxabi1011"' -DTORCH_EXTENSION_NAME=_C -D_GLIBCXX_USE_CXX11_ABI=0
/vllm-ra/csrc/quantization/awq/gemm_kernels.cu(24): warning #177-D: function "vllm::awq::__pack_half2" was declared but never referenced
__pack_half2(const half x, const half y) {
^
Remark: The warnings can be suppressed with "-diag-suppress <warning-number>"
/vllm-ra/csrc/quantization/awq/gemm_kernels.cu(40): warning #177-D: variable "scaling_factors_shared" was declared but never referenced
__attribute__((shared)) half scaling_factors_shared[128];
^
Remark: The warnings can be suppressed with "-diag-suppress <warning-number>"
/vllm-ra/csrc/quantization/awq/gemm_kernels.cu(41): warning #177-D: variable "zeros_shared" was declared but never referenced
__attribute__((shared)) half zeros_shared[128];
^
/vllm-ra/csrc/quantization/awq/gemm_kernels.cu(277): warning #177-D: variable "blockIdx_x" was declared but never referenced
int blockIdx_x = 0;
^
/vllm-ra/csrc/quantization/awq/gemm_kernels.cu(291): warning #177-D: variable "ld_zero_flag" was declared but never referenced
bool ld_zero_flag = (threadIdx.y * 32 + threadIdx.x) * 8 < 64;
^
/vllm-ra/csrc/quantization/awq/gemm_kernels.cu(24): warning #177-D: function "vllm::awq::__pack_half2" was declared but never referenced
__pack_half2(const half x, const half y) {
^
ninja: build stopped: subcommand failed.
Traceback (most recent call last):
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 2100, in _run_ninja_build
subprocess.run(
File "/root/anaconda3/lib/python3.11/subprocess.py", line 571, in run
raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command '['ninja', '-v']' returned non-zero exit status 1.
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/command/editable_wheel.py", line 155, in run
self._create_wheel_file(bdist_wheel)
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/command/editable_wheel.py", line 357, in _create_wheel_file
files, mapping = self._run_build_commands(dist_name, unpacked, lib, tmp)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/command/editable_wheel.py", line 280, in _run_build_commands
self._run_build_subcommands()
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/command/editable_wheel.py", line 307, in _run_build_subcommands
self.run_command(name)
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/_distutils/cmd.py", line 318, in run_command
self.distribution.run_command(command)
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/dist.py", line 967, in run_command super().run_command(command)
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/_distutils/dist.py", line 988, in run_command
cmd_obj.run()
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/command/build_ext.py", line 91, in run
_build_ext.run(self)
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/_distutils/command/build_ext.py", line 345, in run
self.build_extensions()
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 873, in build_extensions
build_ext.build_extensions(self)
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/_distutils/command/build_ext.py", line 467, in build_extensions
self._build_extensions_serial()
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/_distutils/command/build_ext.py", line 493, in _build_extensions_serial
self.build_extension(ext)
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/command/build_ext.py", line 252, in build_extension
_build_ext.build_extension(self, ext)
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/_distutils/command/build_ext.py", line 548, in build_extension
objects = self.compiler.compile(
^^^^^^^^^^^^^^^^^^^^^^
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 686, in unix_wrap_ninja_compile
_write_ninja_file_and_compile_objects(
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 1774, in _write_ninja_file_and_compile_objects
_run_ninja_build(
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 2116, in _run_ninja_build
raise RuntimeError(message) from e
RuntimeError: Error compiling objects for extension
/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/_distutils/dist.py:988: _DebuggingTips: Problem in editable installation.
!!
********************************************************************************
An error happened while installing `vllm` in editable mode.
The following steps are recommended to help debug this problem:
- Try to install the project normally, without using the editable mode.
Does the error still persist?
(If it does, try fixing the problem before attempting the editable mode).
- If you are using binary extensions, make sure you have all OS-level
dependencies installed (e.g. compilers, toolchains, binary libraries, ...).
- Try the latest version of setuptools (maybe the error was already fixed).
- If you (or your project dependencies) are using any setuptools extension
or customization, make sure they support the editable mode.
After following the steps above, if the problem still persists and
you think this is related to how setuptools handles editable installations,
please submit a reproducible example
(see https://stackoverflow.com/help/minimal-reproducible-example) to:
https://github.com/pypa/setuptools/issues
See https://setuptools.pypa.io/en/latest/userguide/development_mode.html for details.
********************************************************************************
!!
cmd_obj.run()
Traceback (most recent call last):
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 2100, in _run_ninja_build
subprocess.run(
File "/root/anaconda3/lib/python3.11/subprocess.py", line 571, in run
raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command '['ninja', '-v']' returned non-zero exit status 1.
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "/root/anaconda3/lib/python3.11/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 353, in <module>
main()
File "/root/anaconda3/lib/python3.11/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py", line 335, in main
json_out['return_val'] = hook(**hook_input['kwargs'])
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
.
.
.
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/command/build_ext.py", line 252, in build_extension
_build_ext.build_extension(self, ext)
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/setuptools/_distutils/command/build_ext.py", line 548, in build_extension
objects = self.compiler.compile(
^^^^^^^^^^^^^^^^^^^^^^
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 686, in unix_wrap_ninja_compile
_write_ninja_file_and_compile_objects(
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 1774, in _write_ninja_file_and_compile_objects
_run_ninja_build(
File "/tmp/pip-build-env-yipypp5p/overlay/lib/python3.11/site-packages/torch/utils/cpp_extension.py", line 2116, in _run_ninja_build
raise RuntimeError(message) from e
RuntimeError: Error compiling objects for extension
error: subprocess-exited-with-error
× Building editable for vllm (pyproject.toml) did not run successfully.
│ exit code: 1
╰─> See above for output.
note: This error originates from a subprocess, and is likely not a problem with pip.
full command: /root/anaconda3/bin/python3 /root/anaconda3/lib/python3.11/site-packages/pip/_vendor/pyproject_hooks/_in_process/_in_process.py build_editable /tmp/tmpmib0bf78
cwd: /vllm-ra
Building editable for vllm (pyproject.toml): finished with status 'error'
ERROR: Failed building editable for vllm
Building wheel for quantile-python (setup.py): started
Running command python setup.py bdist_wheel
running bdist_wheel
running build
running build_py
creating build
creating build/lib
creating build/lib/quantile
copying quantile/__init__.py -> build/lib/quantile
/root/anaconda3/lib/python3.11/site-packages/setuptools/_distutils/cmd.py:66: SetuptoolsDeprecationWarning: setup.py install is deprecated.
!!
!!
self.initialize_options()
installing to build/bdist.linux-x86_64/wheel
running install
running install_lib
creating build/bdist.linux-x86_64
creating build/bdist.linux-x86_64/wheel
creating build/bdist.linux-x86_64/wheel/quantile
copying build/lib/quantile/__init__.py -> build/bdist.linux-x86_64/wheel/quantile
running install_egg_info
running egg_info
writing quantile_python.egg-info/PKG-INFO
writing dependency_links to quantile_python.egg-info/dependency_links.txt
writing top-level names to quantile_python.egg-info/top_level.txt
reading manifest file 'quantile_python.egg-info/SOURCES.txt'
writing manifest file 'quantile_python.egg-info/SOURCES.txt'
Copying quantile_python.egg-info to build/bdist.linux-x86_64/wheel/quantile_python-1.1-py3.11.egg-info
running install_scripts
creating build/bdist.linux-x86_64/wheel/quantile_python-1.1.dist-info/WHEEL
creating '/tmp/pip-wheel-w_wu8np5/quantile_python-1.1-py3-none-any.whl' and adding 'build/bdist.linux-x86_64/wheel' to it
adding 'quantile/__init__.py'
adding 'quantile_python-1.1.dist-info/METADATA'
adding 'quantile_python-1.1.dist-info/WHEEL'
adding 'quantile_python-1.1.dist-info/top_level.txt'
adding 'quantile_python-1.1.dist-info/RECORD'
removing build/bdist.linux-x86_64/wheel
Building wheel for quantile-python (setup.py): finished with status 'done'
Created wheel for quantile-python: filename=quantile_python-1.1-py3-none-any.whl size=3442 sha256=0240ef85c160b3877c9a728b8e9272ae4375e2ab04d6124a9fb1d0c00626f4e0
Stored in directory: /root/.cache/pip/wheels/67/a2/17/29e7169adf03a7e44b922abb6a42c2c1b0fda11f7bfbdb24a2
Successfully built quantile-python
Failed to build vllm
ERROR: Could not build wheels for vllm, which is required to install pyproject.toml-based projects
Killed
usually indicates a SIGKILL
signal sent by the OS if you are running out of memory. Reduce the number of jobs and try to rebuild PyTorch.