Hi,
Where can I find the source code of matrix-matrix multiply kernel that PyTorch uses?
Thank you!
In the new thread I’m chasing things through the PyTorch code for bmm: