New device backend, trying to implement all operators

Hi, we are trying to implement all native operators to make good use of our GPU device (not from NVIDIA).

Could anyone please tell us what operators we should implement?

I’ve checked the native_functions.yaml file, but still not pretty sure where to begin. Are those operators that are with a CUDA dispatch key the only operators that we should implement?