Imported target "torch::nvtoolsext" includes non-existent path Imported target "torch::nvtoolsext" includes non-existent path

netcorefan1 · February 28, 2023, 4:53am

Hello,
I’m trying to compile PT from source in order to get a build that I only needs for Windows, but it fails by complaining a non existent path:
> "C:/Program Files/NVIDIA Corporation/NvToolsExt/include"

System has a full Cuda 12 Sdk toolkit and cudnn, but the toolkit does not install any NvToolsExt inside NVIDIA Corporation folder.
I installed the missing tools with conda install -c conda-forge nvtx, but cmake don’t recognize any difference and return the same error.

(pytorchenv) PS ..\Downloads\pytorch> python setup.py develop
Building wheel torch-2.0.0a0+git801b3f8
-- Building version 2.0.0a0+git801b3f8
cmake -GNinja -DBUILD_PYTHON=True -DBUILD_TEST=True -DCMAKE_BUILD_TYPE=Release -DCMAKE_INSTALL_PREFIX=..\Downloads\pytorch\torch -DCMAKE_PREFIX_PATH=..\anaconda3\envs\pytorchenv\Lib\site-packages -DNUMPY_INCLUDE_DIR=..\anaconda3\envs\pytorchenv\lib\site-packages\numpy\core\include -DPYTHON_EXECUTABLE=..\anaconda3\envs\pytorchenv\python.exe -DPYTHON_INCLUDE_DIR=..\anaconda3\envs\pytorchenv\Include -DPYTHON_LIBRARY=..\anaconda3\envs\pytorchenv/libs/python310.lib -DTORCH_BUILD_VERSION=2.0.0a0+git801b3f8 -DUSE_NUMPY=True ..\Downloads\pytorch
-- The CXX compiler identification is MSVC 19.34.31937.0
-- The C compiler identification is MSVC 19.34.31937.0
-- Detecting CXX compiler ABI info - done
-- Check for working CXX compiler: ../Microsoft Visual Studio/2022/Professional/VC/Tools/MSVC/14.34.31933/bin/Hostx64/x64/cl.exe - skipped
-- Detecting CXX compile features - done
-- Detecting C compiler ABI info - done
-- Check for working C compiler: ../Microsoft Visual Studio/2022/Professional/VC/Tools/MSVC/14.34.31933/bin/Hostx64/x64/cl.exe - skipped
-- Detecting C compile features - done
-- Not forcing any particular BLAS to be found
CMake Warning at CMakeLists.txt:368 (message):
TensorPipe cannot be used on Windows.  Set it to OFF
-- Performing Test COMPILER_WORKS - Success
-- Performing Test CAFFE2_NEED_TO_TURN_OFF_DEPRECATION_WARNING - Failed
-- Performing Test C_HAS_AVX_1 - Success
-- Performing Test C_HAS_AVX2_1 - Success
-- Performing Test C_HAS_AVX512_1 - Failed
-- Performing Test C_HAS_AVX512_2 - Failed
-- Performing Test C_HAS_AVX512_3 - Failed
-- Performing Test CXX_HAS_AVX_1 - Success
-- Performing Test CXX_HAS_AVX2_1 - Success
-- Performing Test CXX_HAS_AVX512_1 - Failed
-- Performing Test CXX_HAS_AVX512_2 - Failed
-- Performing Test CXX_HAS_AVX512_3 - Failed
-- Current compiler supports avx2 extension. Will build perfkernels.
-- Performing Test CAFFE2_COMPILER_SUPPORTS_AVX512_EXTENSIONS - Success
-- Current compiler supports avx512f extension. Will build fbgemm.
-- Performing Test COMPILER_SUPPORTS_HIDDEN_VISIBILITY - Failed
-- Performing Test COMPILER_SUPPORTS_HIDDEN_INLINE_VISIBILITY - Failed
-- Found CUDA: ../NVIDIA GPU Computing Toolkit/CUDA/v12.0 (found version "12.0")
-- The CUDA compiler identification is NVIDIA 12.0.140
-- Detecting CUDA compiler ABI info - done
-- Check for working CUDA compiler: ../NVIDIA GPU Computing Toolkit/CUDA/v12.0/bin/nvcc.exe - skipped
-- Detecting CUDA compile features - done
-- Caffe2: CUDA detected: 12.0
-- Caffe2: CUDA nvcc is: ../NVIDIA GPU Computing Toolkit/CUDA/v12.0/bin/nvcc.exe
-- Caffe2: CUDA toolkit directory: ../NVIDIA GPU Computing Toolkit/CUDA/v12.0
-- Caffe2: Header version is: 12.0
-- ../NVIDIA GPU Computing Toolkit/CUDA/v12.0/lib/x64/nvrtc.lib shorthash is bac8224f
-- Found CUDNN: ../NVIDIA/CUDNN/v8.8/lib/x64/cudnn.lib
-- Autodetected CUDA architecture(s):  8.6
-- Added CUDA NVCC flags for: -gencode;arch=compute_86,code=sm_86
-- Building using own protobuf under third_party per request.
-- Use custom protobuf build.
--
-- 3.13.0.0
-- Looking for pthread.h - not found
-- Found Threads: TRUE
-- Caffe2 protobuf include directory: $<BUILD_INTERFACE:../third_party/protobuf/src>$<INSTALL_INTERFACE:include>
-- Trying to find preferred BLAS backend of choice: MKL
-- MKL_THREADING = OMP
-- Looking for sys/types.h - found
-- Looking for stdint.h - found
-- Looking for stddef.h - found
-- Check size of void* - done
-- MKL_THREADING = OMP
CMake Warning at cmake/Dependencies.cmake:187 (message):
MKL could not be found.  Defaulting to Eigen
Call Stack (most recent call first):
CMakeLists.txt:710 (include)
CMake Warning at cmake/Dependencies.cmake:224 (message):
Preferred BLAS (MKL) cannot be found, now searching for a general BLAS
library
Call Stack (most recent call first):
CMakeLists.txt:710 (include)
-- MKL_THREADING = OMP
-- Checking for [mkl_intel_lp64 - mkl_intel_thread - mkl_core - libiomp5md]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_intel_thread - mkl_core - libiomp5md]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_intel_thread - mkl_core]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_intel_thread - mkl_core]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_sequential - mkl_core]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_sequential - mkl_core]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_core - libiomp5md - pthread]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_core - libiomp5md - pthread]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_core - pthread]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_core - pthread]
--   Library mkl_intel: not found
-- Checking for [mkl - guide - pthread - m]
--   Library mkl: not found
-- MKL library not found
-- Checking for [blis]
--   Library blis: BLAS_blis_LIBRARY-NOTFOUND
-- Checking for [Accelerate]
--   Library Accelerate: BLAS_Accelerate_LIBRARY-NOTFOUND
-- Checking for [vecLib]
--   Library vecLib: BLAS_vecLib_LIBRARY-NOTFOUND
-- Checking for [flexiblas]
--   Library flexiblas: BLAS_flexiblas_LIBRARY-NOTFOUND
-- Checking for [openblas]
--   Library openblas: BLAS_openblas_LIBRARY-NOTFOUND
-- Checking for [openblas - pthread - m]
--   Library openblas: BLAS_openblas_LIBRARY-NOTFOUND
-- Checking for [openblas - pthread - m - gomp]
--   Library openblas: BLAS_openblas_LIBRARY-NOTFOUND
-- Checking for [libopenblas]
--   Library libopenblas: BLAS_libopenblas_LIBRARY-NOTFOUND
-- Checking for [goto2 - gfortran]
--   Library goto2: BLAS_goto2_LIBRARY-NOTFOUND
-- Checking for [goto2 - gfortran - pthread]
--   Library goto2: BLAS_goto2_LIBRARY-NOTFOUND
-- Checking for [acml - gfortran]
--   Library acml: BLAS_acml_LIBRARY-NOTFOUND
-- Checking for [blis]
--   Library blis: BLAS_blis_LIBRARY-NOTFOUND
-- Could NOT find Atlas (missing: Atlas_CBLAS_INCLUDE_DIR Atlas_CLAPACK_INCLUDE_DIR Atlas_CBLAS_LIBRARY Atlas_BLAS_LIBRARY Atlas_LAPACK_LIBRARY)
-- Checking for [ptf77blas - atlas - gfortran]
--   Library ptf77blas: BLAS_ptf77blas_LIBRARY-NOTFOUND
-- Checking for []
-- Looking for sgemm_ - not found
-- Cannot find a library with BLAS API. Not using BLAS.
-- Using pocketfft in directory: ../third_party/pocketfft/
-- The ASM compiler identification is MSVC
-- Found assembler: ../Microsoft Visual Studio/2022/Professional/VC/Tools/MSVC/14.34.31933/bin/Hostx64/x64/cl.exe
-- Found Python: ../envs/pytorchenv/python.exe (found version "3.10.9") found components: Interpreter
-- Found Git: ../Git/cmd/git.exe (found version "2.38.1.windows.1")
-- git version: v1.6.1 normalized to 1.6.1
-- Version: 1.6.1
-- Looking for shm_open in rt - not found
-- Performing Test HAVE_STD_REGEX -- success
-- Performing Test HAVE_GNU_POSIX_REGEX -- failed to compile
-- Performing Test HAVE_POSIX_REGEX -- failed to compile
-- Performing Test HAVE_STEADY_CLOCK -- success
-- Found PythonInterp: ../envs/pytorchenv/python.exe (found version "3.10.9")
-- Performing Test COMPILER_SUPPORTS_AVX512 - Success
-- MKL_THREADING = OMP
CMake Warning (dev) at ../envs/pytorchenv/Library/share/cmake-3.22/Modules/FindPackageHandleStandardArgs.cmake:438 (message):
The package name passed to `find_package_handle_standard_args` (OpenMP_C)
does not match the name of the calling package (OpenMP).  This can lead to
problems in calling code that expects `find_package` result variables
(e.g., `_FOUND`) to follow a certain pattern.
Call Stack (most recent call first):
cmake/Modules/FindOpenMP.cmake:584 (find_package_handle_standard_args)
third_party/fbgemm/CMakeLists.txt:85 (find_package)
This warning is for project developers.  Use -Wno-dev to suppress it.
-- Found OpenMP_C: -openmp:experimental
CMake Warning (dev) at ../envs/pytorchenv/Library/share/cmake-3.22/Modules/FindPackageHandleStandardArgs.cmake:438 (message):
The package name passed to `find_package_handle_standard_args` (OpenMP_CXX)
does not match the name of the calling package (OpenMP).  This can lead to
problems in calling code that expects `find_package` result variables
(e.g., `_FOUND`) to follow a certain pattern.
Call Stack (most recent call first):
cmake/Modules/FindOpenMP.cmake:584 (find_package_handle_standard_args)
third_party/fbgemm/CMakeLists.txt:85 (find_package)
This warning is for project developers.  Use -Wno-dev to suppress it.
-- Found OpenMP_CXX: -openmp:experimental
-- Found OpenMP: TRUE
CMake Warning at third_party/fbgemm/CMakeLists.txt:87 (message):
OpenMP found! OpenMP_C_INCLUDE_DIRS =
CMake Warning at third_party/fbgemm/CMakeLists.txt:186 (message):
==========
CMake Warning at third_party/fbgemm/CMakeLists.txt:187 (message):
CMAKE_BUILD_TYPE = Release
CMake Warning at third_party/fbgemm/CMakeLists.txt:188 (message):
CMAKE_CXX_FLAGS_DEBUG is /Z7 /Ob0 /Od /RTC1 /bigobj
CMake Warning at third_party/fbgemm/CMakeLists.txt:189 (message):
CMAKE_CXX_FLAGS_RELEASE is /O2 /Ob2 /DNDEBUG /bigobj
CMake Warning at third_party/fbgemm/CMakeLists.txt:190 (message):
==========
** AsmJit Summary **
ASMJIT_DIR=../third_party/fbgemm/third_party/asmjit
ASMJIT_TEST=FALSE
ASMJIT_TARGET_TYPE=SHARED
ASMJIT_DEPS=
ASMJIT_LIBS=asmjit
ASMJIT_CFLAGS=
ASMJIT_PRIVATE_CFLAGS=-MP;-GF;-Zc:__cplusplus;-Zc:inline;-Zc:strictStrings;-Zc:threadSafeInit-;-W4
ASMJIT_PRIVATE_CFLAGS_DBG=-GS
ASMJIT_PRIVATE_CFLAGS_REL=-GS-;-O2;-Oi
-- Using third party subdirectory Eigen.
-- Found PythonInterp: ../envs/pytorchenv/python.exe (found suitable version "3.10.9", minimum required is "3.0")
-- Found PythonLibs: ../envs/pytorchenv/libs/python310.lib (found suitable version "3.10.9", minimum required is "3.0")
-- Using third_party/pybind11.
-- pybind11 include dirs: ../cmake/../third_party/pybind11/include
-- Could NOT find MPI_C (missing: MPI_C_LIB_NAMES MPI_C_HEADER_DIR MPI_C_WORKS)
-- Could NOT find MPI_CXX (missing: MPI_CXX_LIB_NAMES MPI_CXX_HEADER_DIR MPI_CXX_WORKS)
-- Could NOT find MPI (missing: MPI_C_FOUND MPI_CXX_FOUND)
CMake Warning at cmake/Dependencies.cmake:1140 (message):
Not compiling with MPI.  Suppress this warning with -DUSE_MPI=OFF
Call Stack (most recent call first):
CMakeLists.txt:710 (include)
-- Adding OpenMP CXX_FLAGS: -openmp:experimental
-- Found CUB: ../NVIDIA GPU Computing Toolkit/CUDA/v12.0/include
-- Converting CMAKE_CUDA_FLAGS to CUDA_NVCC_FLAGS:
CUDA_NVCC_FLAGS                = -Xcompiler;/w;-w;-Xcompiler;/FS;-Xfatbin;-compress-all;-DONNX_NAMESPACE=onnx_torch;--use-local-env;-gencode;arch=compute_86,code=sm_86;-Xcudafe;--diag_suppress=cc_clobber_ignored,--diag_suppress=integer_sign_change,--diag_suppress=useless_using_declaration,--diag_suppress=set_but_not_used,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=implicit_return_from_non_void_function,--diag_suppress=unsigned_compare_with_zero,--diag_suppress=declared_but_not_referenced,--diag_suppress=bad_friend_decl;--Werror;cross-execution-space-call;--no-host-device-move-forward;--expt-relaxed-constexpr;--expt-extended-lambda
CUDA_NVCC_FLAGS_DEBUG          = -Xcompiler=-Zi,-Ob0,-Od,/RTC1;-Xcompiler;/MDd
CUDA_NVCC_FLAGS_RELEASE        = -Xcompiler=-O2,-Ob2;-DNDEBUG;-Xcompiler;/MD
CUDA_NVCC_FLAGS_RELWITHDEBINFO = -Xcompiler=-Zi,-O2,-Ob1;-DNDEBUG;-Xcompiler;/MD
CUDA_NVCC_FLAGS_MINSIZEREL     = -Xcompiler=-O1,-Ob1;-DNDEBUG;-Xcompiler;/MD
CMake Warning (dev) at third_party/gloo/CMakeLists.txt:21 (option):
Policy CMP0077 is not set: option() honors normal variables.  Run "cmake
--help-policy CMP0077" for policy details.  Use the cmake_policy command to
set the policy and suppress this warning.
For compatibility with older versions of CMake, option is clearing the
normal variable 'BUILD_BENCHMARK'.
This warning is for project developers.  Use -Wno-dev to suppress it.
CMake Warning (dev) at third_party/gloo/CMakeLists.txt:35 (option):
Policy CMP0077 is not set: option() honors normal variables.  Run "cmake
--help-policy CMP0077" for policy details.  Use the cmake_policy command to
set the policy and suppress this warning.
For compatibility with older versions of CMake, option is clearing the
normal variable 'USE_NCCL'.
This warning is for project developers.  Use -Wno-dev to suppress it.
CMake Warning (dev) at third_party/gloo/CMakeLists.txt:36 (option):
Policy CMP0077 is not set: option() honors normal variables.  Run "cmake
--help-policy CMP0077" for policy details.  Use the cmake_policy command to
set the policy and suppress this warning.
For compatibility with older versions of CMake, option is clearing the
normal variable 'USE_RCCL'.
This warning is for project developers.  Use -Wno-dev to suppress it.
-- MSVC detected
-- Set USE_REDIS OFF
-- Set USE_IBVERBS OFF
-- Set USE_NCCL OFF
-- Set USE_RCCL OFF
-- Set USE_LIBUV ON
-- Only USE_LIBUV is supported on Windows
-- Gloo build as SHARED library
CMake Warning (dev) at cmake/Modules_CUDA_fix/upstream/FindCUDA.cmake:547 (if):
Policy CMP0054 is not set: Only interpret if() arguments as variables or
keywords when unquoted.  Run "cmake --help-policy CMP0054" for policy
details.  Use the cmake_policy command to set the policy and suppress this
warning.
Quoted variables like "MSVC" will no longer be dereferenced when the policy
is set to NEW.  Since the policy is not set the OLD behavior will be used.
Call Stack (most recent call first):
cmake/Modules_CUDA_fix/FindCUDA.cmake:11 (include)
third_party/gloo/cmake/Cuda.cmake:122 (find_package)
third_party/gloo/cmake/Dependencies.cmake:115 (include)
third_party/gloo/CMakeLists.txt:111 (include)
This warning is for project developers.  Use -Wno-dev to suppress it.
-- Found CUDA: ../NVIDIA GPU Computing Toolkit/CUDA/v12.0 (found suitable version "12.0", minimum required is "7.0")
-- CUDA detected: 12.0
CMake Warning at cmake/Dependencies.cmake:1461 (message):
Metal is only used in ios builds.
Call Stack (most recent call first):
CMakeLists.txt:710 (include)
-- Found PythonInterp: ../envs/pytorchenv/python.exe (found version "3.10.9")
Generated: ../build/third_party/onnx/onnx/onnx_onnx_torch-ml.proto
Generated: ../build/third_party/onnx/onnx/onnx-operators_onnx_torch-ml.proto
Generated: ../build/third_party/onnx/onnx/onnx-data_onnx_torch.proto
--
-- ******** Summary ********
--   CMake version             : 3.22.1
--   CMake command             : ../envs/pytorchenv/Library/bin/cmake.exe
--   System                    : Windows
--   C++ compiler              : ../Microsoft Visual Studio/2022/Professional/VC/Tools/MSVC/14.34.31933/bin/Hostx64/x64/cl.exe
--   C++ compiler version      : 19.34.31937.0
--   CXX flags                 : /DWIN32 /D_WINDOWS /GR /EHsc /bigobj /FS -DUSE_PTHREADPOOL /std:c++17 /EHsc /wd26812
--   Build type                : Release
--   Compile definitions       : ONNX_ML=1;ONNXIFI_ENABLE_EXT=1;__STDC_FORMAT_MACROS
--   CMAKE_PREFIX_PATH         : ..\anaconda3\envs\pytorchenv\Lib\site-packages;../NVIDIA GPU Computing Toolkit/CUDA/v12.0
--   CMAKE_INSTALL_PREFIX      : ../torch
--   CMAKE_MODULE_PATH         : ../cmake/Modules;../cmake/public/../Modules_CUDA_fix
--
--   ONNX version              : 1.13.1rc2
--   ONNX NAMESPACE            : onnx_torch
--   ONNX_USE_LITE_PROTO       : OFF
--   USE_PROTOBUF_SHARED_LIBS  : OFF
--   Protobuf_USE_STATIC_LIBS  : ON
--   ONNX_DISABLE_EXCEPTIONS   : OFF
--   ONNX_WERROR               : OFF
--   ONNX_BUILD_TESTS          : OFF
--   ONNX_BUILD_BENCHMARKS     : OFF
--
--   Protobuf compiler         :
--   Protobuf includes         :
--   Protobuf libraries        :
--   BUILD_ONNX_PYTHON         : OFF
--
-- ******** Summary ********
--   CMake version         : 3.22.1
--   CMake command         : ../envs/pytorchenv/Library/bin/cmake.exe
--   System                : Windows
--   C++ compiler          : ../Microsoft Visual Studio/2022/Professional/VC/Tools/MSVC/14.34.31933/bin/Hostx64/x64/cl.exe
--   C++ compiler version  : 19.34.31937.0
--   CXX flags             : /DWIN32 /D_WINDOWS /GR /EHsc /bigobj /FS -DUSE_PTHREADPOOL
--   Build type            : Release
--   Compile definitions   : ONNX_ML=1;ONNXIFI_ENABLE_EXT=1
--   CMAKE_PREFIX_PATH     : ..\anaconda3\envs\pytorchenv\Lib\site-packages;../NVIDIA GPU Computing Toolkit/CUDA/v12.0
--   CMAKE_INSTALL_PREFIX  : ../torch
--   CMAKE_MODULE_PATH     : ../cmake/Modules;../cmake/public/../Modules_CUDA_fix
--
--   ONNX version          : 1.4.1
--   ONNX NAMESPACE        : onnx_torch
--   ONNX_BUILD_TESTS      : OFF
--   ONNX_BUILD_BENCHMARKS : OFF
--   ONNX_USE_LITE_PROTO   : OFF
--   ONNXIFI_DUMMY_BACKEND :
--
--   Protobuf compiler     :
--   Protobuf includes     :
--   Protobuf libraries    :
--   BUILD_ONNX_PYTHON     : OFF
-- Found CUDA with FP16 support, compiling with torch.cuda.HalfTensor
-- Adding -DNDEBUG to compile flags
-- Checking prototype magma_get_sgeqrf_nb for MAGMA_V2
-- Checking prototype magma_get_sgeqrf_nb for MAGMA_V2 - False
CMake Warning at cmake/Dependencies.cmake:1650 (message):
Not compiling with MAGMA.  Suppress this warning with -DUSE_MAGMA=OFF.
Call Stack (most recent call first):
CMakeLists.txt:710 (include)
-- Could not find hardware support for NEON on this machine.
-- No OMAP3 processor on this machine.
-- No OMAP4 processor on this machine.
-- MKL_THREADING = OMP
-- Checking for [mkl_intel_lp64 - mkl_intel_thread - mkl_core - libiomp5md]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_intel_thread - mkl_core - libiomp5md]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_intel_thread - mkl_core]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_intel_thread - mkl_core]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_sequential - mkl_core]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_sequential - mkl_core]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_core - libiomp5md - pthread]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_core - libiomp5md - pthread]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_core - pthread]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_core - pthread]
--   Library mkl_intel: not found
-- Checking for [mkl - guide - pthread - m]
--   Library mkl: not found
-- MKL library not found
-- Checking for [blis]
--   Library blis: BLAS_blis_LIBRARY-NOTFOUND
-- Checking for [Accelerate]
--   Library Accelerate: BLAS_Accelerate_LIBRARY-NOTFOUND
-- Checking for [vecLib]
--   Library vecLib: BLAS_vecLib_LIBRARY-NOTFOUND
-- Checking for [flexiblas]
--   Library flexiblas: BLAS_flexiblas_LIBRARY-NOTFOUND
-- Checking for [openblas]
--   Library openblas: BLAS_openblas_LIBRARY-NOTFOUND
-- Checking for [openblas - pthread - m]
--   Library openblas: BLAS_openblas_LIBRARY-NOTFOUND
-- Checking for [openblas - pthread - m - gomp]
--   Library openblas: BLAS_openblas_LIBRARY-NOTFOUND
-- Checking for [libopenblas]
--   Library libopenblas: BLAS_libopenblas_LIBRARY-NOTFOUND
-- Checking for [goto2 - gfortran]
--   Library goto2: BLAS_goto2_LIBRARY-NOTFOUND
-- Checking for [goto2 - gfortran - pthread]
--   Library goto2: BLAS_goto2_LIBRARY-NOTFOUND
-- Checking for [acml - gfortran]
--   Library acml: BLAS_acml_LIBRARY-NOTFOUND
-- Checking for [blis]
--   Library blis: BLAS_blis_LIBRARY-NOTFOUND
-- Could NOT find Atlas (missing: Atlas_CBLAS_INCLUDE_DIR Atlas_CLAPACK_INCLUDE_DIR Atlas_CBLAS_LIBRARY Atlas_BLAS_LIBRARY Atlas_LAPACK_LIBRARY)
-- Checking for [ptf77blas - atlas - gfortran]
--   Library ptf77blas: BLAS_ptf77blas_LIBRARY-NOTFOUND
-- Checking for []
-- Cannot find a library with BLAS API. Not using BLAS.
-- LAPACK requires BLAS
-- Cannot find a library with LAPACK API. Not using LAPACK.
disabling ROCM because NOT USE_ROCM is set
-- MIOpen not found. Compiling without MIOpen support
-- MKL_THREADING = OMP
-- Checking for [mkl_intel_lp64 - mkl_intel_thread - mkl_core - libiomp5md]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_intel_thread - mkl_core - libiomp5md]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_intel_thread - mkl_core]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_intel_thread - mkl_core]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_sequential - mkl_core]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_sequential - mkl_core]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_core - libiomp5md - pthread]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_core - libiomp5md - pthread]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_core - pthread]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_core - pthread]
--   Library mkl_intel: not found
-- Checking for [mkl - guide - pthread - m]
--   Library mkl: not found
-- MKL library not found
-- Checking for [blis]
--   Library blis: BLAS_blis_LIBRARY-NOTFOUND
-- Checking for [Accelerate]
--   Library Accelerate: BLAS_Accelerate_LIBRARY-NOTFOUND
-- Checking for [vecLib]
--   Library vecLib: BLAS_vecLib_LIBRARY-NOTFOUND
-- Checking for [flexiblas]
--   Library flexiblas: BLAS_flexiblas_LIBRARY-NOTFOUND
-- Checking for [openblas]
--   Library openblas: BLAS_openblas_LIBRARY-NOTFOUND
-- Checking for [openblas - pthread - m]
--   Library openblas: BLAS_openblas_LIBRARY-NOTFOUND
-- Checking for [openblas - pthread - m - gomp]
--   Library openblas: BLAS_openblas_LIBRARY-NOTFOUND
-- Checking for [libopenblas]
--   Library libopenblas: BLAS_libopenblas_LIBRARY-NOTFOUND
-- Checking for [goto2 - gfortran]
--   Library goto2: BLAS_goto2_LIBRARY-NOTFOUND
-- Checking for [goto2 - gfortran - pthread]
--   Library goto2: BLAS_goto2_LIBRARY-NOTFOUND
-- Checking for [acml - gfortran]
--   Library acml: BLAS_acml_LIBRARY-NOTFOUND
-- Checking for [blis]
--   Library blis: BLAS_blis_LIBRARY-NOTFOUND
-- Could NOT find Atlas (missing: Atlas_CBLAS_INCLUDE_DIR Atlas_CLAPACK_INCLUDE_DIR Atlas_CBLAS_LIBRARY Atlas_BLAS_LIBRARY Atlas_LAPACK_LIBRARY)
-- Checking for [ptf77blas - atlas - gfortran]
--   Library ptf77blas: BLAS_ptf77blas_LIBRARY-NOTFOUND
-- Checking for []
-- Cannot find a library with BLAS API. Not using BLAS.
-- MKLDNN_CPU_RUNTIME = OMP
-- DNNL_TARGET_ARCH: X64
-- DNNL_LIBRARY_NAME: dnnl
CMake Warning (dev) at ../envs/pytorchenv/Library/share/cmake-3.22/Modules/FindPackageHandleStandardArgs.cmake:438 (message):
The package name passed to `find_package_handle_standard_args` (OpenMP_C)
does not match the name of the calling package (OpenMP).  This can lead to
problems in calling code that expects `find_package` result variables
(e.g., `_FOUND`) to follow a certain pattern.
Call Stack (most recent call first):
cmake/Modules/FindOpenMP.cmake:584 (find_package_handle_standard_args)
third_party/ideep/mkl-dnn/third_party/oneDNN/cmake/OpenMP.cmake:69 (find_package)
third_party/ideep/mkl-dnn/third_party/oneDNN/CMakeLists.txt:117 (include)
This warning is for project developers.  Use -Wno-dev to suppress it.
CMake Warning (dev) at ../envs/pytorchenv/Library/share/cmake-3.22/Modules/FindPackageHandleStandardArgs.cmake:438 (message):
The package name passed to `find_package_handle_standard_args` (OpenMP_CXX)
does not match the name of the calling package (OpenMP).  This can lead to
problems in calling code that expects `find_package` result variables
(e.g., `_FOUND`) to follow a certain pattern.
Call Stack (most recent call first):
cmake/Modules/FindOpenMP.cmake:584 (find_package_handle_standard_args)
third_party/ideep/mkl-dnn/third_party/oneDNN/cmake/OpenMP.cmake:69 (find_package)
third_party/ideep/mkl-dnn/third_party/oneDNN/CMakeLists.txt:117 (include)
This warning is for project developers.  Use -Wno-dev to suppress it.
-- Could NOT find Doxyrest (missing: DOXYREST_EXECUTABLE)
-- Found PythonInterp: ../envs/pytorchenv/python.exe (found suitable version "3.10.9", minimum required is "2.7")
-- Could NOT find Sphinx (missing: SPHINX_EXECUTABLE)
-- Enabled workload: TRAINING
-- Enabled primitives: ALL
-- Enabled primitive CPU ISA: ALL
-- Primitive cache is enabled
-- The ASM_MASM compiler identification is MSVC
-- Found assembler: ../Microsoft Visual Studio/2022/Professional/VC/Tools/MSVC/14.34.31933/bin/Hostx64/x64/ml64.exe
-- Found MKL-DNN: TRUE
-- Performing Test C_HAS_THREAD - Success
-- Module support is disabled.
-- Version: 9.1.0
-- Build type: Release
-- CXX_STANDARD: 17
-- Required features: cxx_variadic_templates
-- Using CPU-only version of Kineto
-- Configuring Kineto dependency:
--   KINETO_SOURCE_DIR = ../third_party/kineto/libkineto
--   KINETO_BUILD_TESTS = OFF
--   KINETO_LIBRARY_TYPE = static
-- Found PythonInterp: ../envs/pytorchenv/python.exe (found version "3.10.9")
INFO CUDA_SOURCE_DIR =
INFO ROCM_SOURCE_DIR =
INFO CUPTI unavailable or disabled - not building GPU profilers
-- Kineto: FMT_SOURCE_DIR = ../third_party/fmt
-- Kineto: FMT_INCLUDE_DIR = ../third_party/fmt/include
INFO CUPTI_INCLUDE_DIR = /extras/CUPTI/include
INFO ROCTRACER_INCLUDE_DIR = /include/roctracer
INFO DYNOLOG_INCLUDE_DIR = ../third_party/kineto/libkineto/third_party/dynolog/
INFO IPCFABRIC_INCLUDE_DIR = ../third_party/kineto/libkineto/third_party/dynolog//dynolog/src/ipcfabric/
-- Configured Kineto (CPU)
CMake Warning (dev) at ../envs/pytorchenv/Library/share/cmake-3.22/Modules/CMakeDependentOption.cmake:84 (message):
Policy CMP0127 is not set: cmake_dependent_option() supports full Condition
Syntax.  Run "cmake --help-policy CMP0127" for policy details.  Use the
cmake_policy command to set the policy and suppress this warning.
Call Stack (most recent call first):
CMakeLists.txt:713 (cmake_dependent_option)
This warning is for project developers.  Use -Wno-dev to suppress it.
-- Performing Test HAS/UTF_8 - Success
-- Performing Test HAS/WD4624 - Success
-- Performing Test HAS/WD4068 - Success
-- Performing Test HAS/WD4067 - Success
-- Performing Test HAS/WD4267 - Success
-- Performing Test HAS/WD4661 - Success
-- Performing Test HAS/WD4717 - Success
-- Performing Test HAS/WD4244 - Success
-- Performing Test HAS/WD4804 - Success
-- Performing Test HAS/WD4273 - Success
-- Performing Test HAS_WNO_STRINGOP_OVERFLOW - Failed
-- Looking for backtrace - not found
-- Could NOT find Backtrace (missing: Backtrace_LIBRARY Backtrace_INCLUDE_DIR)
-- don't use NUMA
-- headers outputs:
-- sources outputs:
-- declarations_yaml outputs:
-- Performing Test COMPILER_SUPPORTS_NO_AVX256_SPLIT - Failed
-- Using ATen parallel backend: OMP
AT_INSTALL_INCLUDE_DIR include/ATen/core
core header install: ../build/aten/src/ATen/core/TensorBody.h
core header install: ../build/aten/src/ATen/core/aten_interned_strings.h
core header install: ../build/aten/src/ATen/core/enum_tag.h

netcorefan1 · February 28, 2023, 4:54am

…second and last part of the log:

-- Generating sources for unboxing kernels ../envs/pytorchenv/python.exe;-m;torchgen.gen_executorch;--source-path=../test/edge/../../test/edge;--install-dir=../build/out;--tags-path=../test/edge/../../aten/src/ATen/native/tags.yaml;--aten-yaml-path=../test/edge/../../aten/src/ATen/native/native_functions.yaml;--use-aten-lib;--op-selection-yaml-path=../test/edge/../../test/edge/selected_operators.yaml;--custom-ops-yaml-path=../test/edge/../../test/edge/custom_ops.yaml
CMake Warning (dev) at torch/CMakeLists.txt:387:
Syntax Warning in cmake code at column 107
Argument not separated from preceding token by whitespace.
This warning is for project developers.  Use -Wno-dev to suppress it.
CMake Warning (dev) at torch/CMakeLists.txt:387:
Syntax Warning in cmake code at column 115
Argument not separated from preceding token by whitespace.
This warning is for project developers.  Use -Wno-dev to suppress it.
-- Autodetected CUDA architecture(s):  8.6
-- Using Lib/site-packages as python relative installation path
CMake Warning at CMakeLists.txt:1092 (message):
Generated cmake files are only fully tested if one builds with system glog,
gflags, and protobuf.  Other settings may generate files that are not well
tested.
-- somehow this is happening
--
-- ******** Summary ********
-- General:
--   CMake version         : 3.22.1
--   CMake command         : ../envs/pytorchenv/Library/bin/cmake.exe
--   System                : Windows
--   C++ compiler          : ../Microsoft Visual Studio/2022/Professional/VC/Tools/MSVC/14.34.31933/bin/Hostx64/x64/cl.exe
--   C++ compiler id       : MSVC
--   C++ compiler version  : 19.34.31937.0
--   Using ccache if found : OFF
--   CXX flags             : /DWIN32 /D_WINDOWS /GR /EHsc /bigobj /FS -DUSE_PTHREADPOOL -DNDEBUG -DUSE_KINETO -DLIBKINETO_NOCUPTI -DLIBKINETO_NOROCTRACER -DUSE_FBGEMM -DUSE_XNNPACK -DSYMBOLICATE_MOBILE_DEBUG_HANDLE /utf-8 /wd4624 /wd4068 /wd4067 /wd4267 /wd4661 /wd4717 /wd4244 /wd4804 /wd4273
--   Build type            : Release
--   Compile definitions   : ONNX_ML=1;ONNXIFI_ENABLE_EXT=1;ONNX_NAMESPACE=onnx_torch;_CRT_SECURE_NO_DEPRECATE=1;USE_EXTERNAL_MZCRC;MINIZ_DISABLE_ZIP_READER_CRC32_CHECKS;USE_FLASH_ATTENTION;WIN32_LEAN_AND_MEAN;_UCRT_LEGACY_INFINITY;NOMINMAX;BUILD_NVFUSER
--   CMAKE_PREFIX_PATH     : ..\anaconda3\envs\pytorchenv\Lib\site-packages;../NVIDIA GPU Computing Toolkit/CUDA/v12.0
--   CMAKE_INSTALL_PREFIX  : ../torch
--   USE_GOLD_LINKER       : OFF
--
--   TORCH_VERSION         : 2.0.0
--   CAFFE2_VERSION        : 2.0.0
--   BUILD_CAFFE2          : OFF
--   BUILD_CAFFE2_OPS      : OFF
--   BUILD_STATIC_RUNTIME_BENCHMARK: OFF
--   BUILD_TENSOREXPR_BENCHMARK: OFF
--   BUILD_NVFUSER_BENCHMARK: OFF
--   BUILD_BINARY          : OFF
--   BUILD_CUSTOM_PROTOBUF : ON
--     Link local protobuf : ON
--   BUILD_DOCS            : OFF
--   BUILD_PYTHON          : True
--     Python version      : 3.10.9
--     Python executable   : ../envs/pytorchenv/python.exe
--     Pythonlibs version  : 3.10.9
--     Python library      : ../envs/pytorchenv/libs/python310.lib
--     Python includes     : ../envs/pytorchenv/include
--     Python site-packages: Lib/site-packages
--   BUILD_SHARED_LIBS     : ON
--   CAFFE2_USE_MSVC_STATIC_RUNTIME     : OFF
--   BUILD_TEST            : True
--   BUILD_JNI             : OFF
--   BUILD_MOBILE_AUTOGRAD : OFF
--   BUILD_LITE_INTERPRETER: OFF
--   INTERN_BUILD_MOBILE   :
--   TRACING_BASED         : OFF
--   USE_BLAS              : 0
--   USE_LAPACK            : 0
--   USE_ASAN              : OFF
--   USE_TSAN              : OFF
--   USE_CPP_CODE_COVERAGE : OFF
--   USE_CUDA              : ON
--     Split CUDA          :
--     CUDA static link    : OFF
--     USE_CUDNN           : ON
--     USE_EXPERIMENTAL_CUDNN_V8_API: ON
--     CUDA version        : 12.0
--     USE_FLASH_ATTENTION : ON
--     cuDNN version       : 8.8.0
--     CUDA root directory : ../NVIDIA GPU Computing Toolkit/CUDA/v12.0
--     CUDA library        : ../NVIDIA GPU Computing Toolkit/CUDA/v12.0/lib/x64/cuda.lib
--     cudart library      : ../NVIDIA GPU Computing Toolkit/CUDA/v12.0/lib/x64/cudart_static.lib
--     cublas library      : ../NVIDIA GPU Computing Toolkit/CUDA/v12.0/lib/x64/cublas.lib;../NVIDIA GPU Computing Toolkit/CUDA/v12.0/lib/x64/cublasLt.lib
--     cufft library       : ../NVIDIA GPU Computing Toolkit/CUDA/v12.0/lib/x64/cufft.lib
--     curand library      : ../NVIDIA GPU Computing Toolkit/CUDA/v12.0/lib/x64/curand.lib
--     cuDNN library       : ../NVIDIA/CUDNN/v8.8/lib/x64/cudnn.lib
--     nvrtc               : ../NVIDIA GPU Computing Toolkit/CUDA/v12.0/lib/x64/nvrtc.lib
--     CUDA include path   : ../NVIDIA GPU Computing Toolkit/CUDA/v12.0/include
--     NVCC executable     : ../NVIDIA GPU Computing Toolkit/CUDA/v12.0/bin/nvcc.exe
--     CUDA compiler       : ../NVIDIA GPU Computing Toolkit/CUDA/v12.0/bin/nvcc.exe
--     CUDA flags          :  -Xcompiler /w -w -Xcompiler /FS -Xfatbin -compress-all -DONNX_NAMESPACE=onnx_torch --use-local-env -gencode arch=compute_86,code=sm_86 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=integer_sign_change,--diag_suppress=useless_using_declaration,--diag_suppress=set_but_not_used,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=implicit_return_from_non_void_function,--diag_suppress=unsigned_compare_with_zero,--diag_suppress=declared_but_not_referenced,--diag_suppress=bad_friend_decl --Werror cross-execution-space-call --no-host-device-move-forward --expt-relaxed-constexpr --expt-extended-lambda  -Xcompiler=/wd4819,/wd4503,/wd4190,/wd4244,/wd4251,/wd4275,/wd4522 -Wno-deprecated-gpu-targets --expt-extended-lambda -DCUB_WRAPPED_NAMESPACE=at_cuda_detail -DCUDA_HAS_FP16=1 -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__
--     CUDA host compiler  :
--     CUDA --device-c     : OFF
--     USE_TENSORRT        : OFF
--   USE_ROCM              : OFF
--   BUILD_NVFUSER         : ON
--   USE_EIGEN_FOR_BLAS    : ON
--   USE_FBGEMM            : ON
--     USE_FAKELOWP          : OFF
--   USE_KINETO            : ON
--   USE_FFMPEG            : OFF
--   USE_GFLAGS            : OFF
--   USE_GLOG              : OFF
--   USE_LEVELDB           : OFF
--   USE_LITE_PROTO        : OFF
--   USE_LMDB              : OFF
--   USE_METAL             : OFF
--   USE_PYTORCH_METAL     : OFF
--   USE_PYTORCH_METAL_EXPORT     : OFF
--   USE_MPS               : OFF
--   USE_FFTW              : OFF
--   USE_MKL               : OFF
--   USE_MKLDNN            : ON
--   USE_MKLDNN_ACL        : OFF
--   USE_MKLDNN_CBLAS      : OFF
--   USE_UCC               : OFF
--   USE_ITT               : ON
--   USE_NCCL              : OFF
--   USE_NNPACK            : OFF
--   USE_NUMPY             : ON
--   USE_OBSERVERS         : ON
--   USE_OPENCL            : OFF
--   USE_OPENCV            : OFF
--   USE_OPENMP            : ON
--   USE_TBB               : OFF
--   USE_VULKAN            : OFF
--   USE_PROF              : OFF
--   USE_QNNPACK           : OFF
--   USE_PYTORCH_QNNPACK   : OFF
--   USE_XNNPACK           : ON
--   USE_REDIS             : OFF
--   USE_ROCKSDB           : OFF
--   USE_ZMQ               : OFF
--   USE_DISTRIBUTED       : ON
--     USE_MPI               : OFF
--     USE_GLOO              : ON
--     USE_GLOO_WITH_OPENSSL : OFF
--     USE_TENSORPIPE        : OFF
--   Public Dependencies  :
--   Private Dependencies : Threads::Threads;pthreadpool;cpuinfo;XNNPACK;fbgemm;ittnotify;fp16;caffe2::openmp;gloo;foxi_loader;fmt::fmt-header-only;kineto
--   USE_COREML_DELEGATE     : OFF
--   BUILD_LAZY_TS_BACKEND   : ON
--   TORCH_DISABLE_GPU_ASSERTS : OFF
-- Configuring done
CMake Error in caffe2/CMakeLists.txt:
Imported target "torch::nvtoolsext" includes non-existent path
"../NVIDIA Corporation/NvToolsExt/include"
in its INTERFACE_INCLUDE_DIRECTORIES.
-- Generating done
CMake Generate step failed.  Build files cannot be regenerated correctly.

I would like to ask if there is a way to get precompiled binaries, because even if I manage to get the whole thing compiled (it’s nearly one day of attempts), the above log shows so many errors and failures not supposed to happens on an RTX3070 card and I am afraid that the resulting build could not work as expected.
Many thanks

ptrblck · February 28, 2023, 6:16am

You could install the pip wheels or conda binaries for the latest stable PyTorch release with CUDA 11.6 or 11.7 or for the nightly release with CUDA 11.7 or 11.8 instead of building PyTorch from source.

netcorefan1 · February 28, 2023, 9:06am

Decision to build from source came from the fact that I wanted to switch from 1.13 to CUDA 12 and take advantages of all the features of my card. If I have understood well, support has been recently implemented and I wanted to take a chance to try.
Even if I will be forced to return back to the old version, I’m afraid that I will have to downgrade system drivers. I remember that I have only been able to make PyTorch working with 11.7 after downgrading drivers to 517.40 (last version to support 11.7). I could not even use 11.8 because PyTorch 3D is tied to PyTorch and it doesn’t support its nightly build.

I could be wrong, but to me seems that Cuda 12 reintroduced this three years old issue, but actually I have no idea on what could be the right workaround. Should I create a new torch.compile Bug Report in the repo?

ptrblck · February 28, 2023, 9:08am

Your 3070 is fully supported in the current builds using CUDA 11.7 and 11.8.

No, you won’t have to downgrade the drivers as they are backwards compatible.

netcorefan1 · February 28, 2023, 11:30am

Isn’t Cuda 12 (and related drivers) considered a major change which breaks backward compatibility with older versions? If you say so, then I must have missed out something in the configurations.
Maximum supported version from PyTorch3D is PyTorch 1.13.0. So, I’m stuck to 11.7 if I want boths. PyTorch3D just refuses any linkage with nightly builds.
Anyway, it’s nice to know that if everything goes wrong I will still be able to use GPU without downgrading drivers, but I still want to build for Cuda 12 which is the main purpose of this topic.

Cuda 12 seems to have broken the binaries and libraries paths. It should be just a matter to provide the corrected path and most the problems, if not all, should be fixed.