The c++ extension tutorial uses torch::PackedTensorAccessor32
and states
This is important as using the 64-bit variant (
PackedAccessor64
) can make the kernel slower
But the github code it links to uses torch::PackedTensorAccessor
. The code is supposed to be the same in both. Which one is correct?
Update: based on compile warnings it looks like the tutorial is correct and the github code is out of date.