Hi,
I’m currently using both the CUDA Graph API and torch.compile() in my source.
I’m aware that torch.compile also provides an option to automatically use CUDA graph based on this post.
Question: It is fair to say that the manual construction of CUDA Graph using the aforementioned API is discouraged then *based on this post? I couldn’t find any real world example (other than the ones in CUDA Graph API) I find it quite challenging to debug as the only debug option is to use CUDAGraph.debug_dump() and view the graph.