For example a simple jit’ed mse:
@torch.jit.script def jit_mse(input, target): return ((input - target)**2).mean()
shows the cuda kernel
fused_sub_pow in the profiler, however it doesn’t change the original graph (with aten::sub and aten::pow)
This seems to make the graph property a bit useless, or is this not its intended usage?