PyTORCH and NPU

Hello all,

I have a question regarding the inner-working of PyTorch and trying to understand how can PyTorch leverage from NPU and basically interface custom NPU for PyTorch?
In other words how to access the computing capability of NPU to Pytorch to have benefited from?
Is there any interface in PyTorch to be developed on custom NPU and provide compute performance?

Thank you.