What is different between FusionGroup and CudaFusionGroup?

alanzhai219 · May 19, 2021, 2:50am

Different API will result in different group such as FusionGroup and CudaFusionGroup. Why? And what is different?

tom · May 19, 2021, 7:45am

These are actually different fuser generations.
The “classic” 1st-gen fuser only did pointwise ops and created FusionGroup nodes. The newer fuser developed by a team at NVIDIA creates CUDAFusionGroup. To round off the trio, there is TensorExprGroup nodes created by the TensorExpr/NNC fuser developed by a team at FB. The latter two also support some reductions.
A while ago, I wrote a blog on the various fusers.

Best regards

Thomas

alanzhai219 · May 19, 2021, 8:16am

Thanks for your reply. I read your blog first. It seems that the mechanism is not easy to figure out.

tom · May 19, 2021, 8:36am

The JIT optimization steps probably are among the most sophisticated bits in PyTorch (along with the dispatcher,…). For a deep dive on one of the fusers, I can also enthusiastically recommend Christian Sarofeen’s talk (I think you need to register to see it).

Best regards

Thomas

alanzhai219 · May 19, 2021, 11:39am

Excellent! I will look into it and will email you when encounter any question. Much thanks.