c10::NotImplementedError with minimal example

I cannot reproduce the issue using libtorch==1.10.2_cu113 on Linux and get:

 1  0  0
 0  1  0
 0  0  1
[ CUDAFloatType{3,3} ]

as the output.
Maybe you are hitting this issue on Windows.