Yes, a matmul
approach will be used for the native implementations as given here. Different backends (e.g. cuDNN) could call different algorithms internally, which depend on the workload shape etc.
1 Like