About the torch.compile category
|
|
0
|
1244
|
January 9, 2023
|
Choice of torch.compile vs. triton
|
|
7
|
7647
|
August 29, 2025
|
Reducing compiled call overhead
|
|
0
|
5
|
August 29, 2025
|
Dynamo logs generated by torch.distributed
|
|
2
|
30
|
August 27, 2025
|
PyTorch Error: Compiler: cl is not found
|
|
3
|
884
|
August 27, 2025
|
How can we reliably figured out what the parameters of a generated PTX kernel are with torch.compile()?
|
|
0
|
17
|
August 25, 2025
|
Module 'torch.library' has no attribute 'custom_op'
|
|
1
|
206
|
August 23, 2025
|
C++/Cuda / aot.compile and cuda graph
|
|
2
|
537
|
August 23, 2025
|
Training with flex attention is extremely slow due to torch.compile settings
|
|
0
|
21
|
August 22, 2025
|
Is it possible that node's name can change during torch.compile?
|
|
0
|
10
|
August 22, 2025
|
Torch.compile Numpy code throws mean() arguments error
|
|
0
|
8
|
August 21, 2025
|
Using RNG Generator in torch.compile
|
|
2
|
28
|
August 20, 2025
|
How to turn off inlining / force materialization in TorchInductor during torch.compile?
|
|
0
|
13
|
August 16, 2025
|
Some questions about torch.compile
|
|
0
|
18
|
August 15, 2025
|
`torch.compile` (w/ Torch Inductor) benchmarks/models for Multi GPU
|
|
0
|
26
|
August 7, 2025
|
CUDA Graph Error with Residual Connections in `torch.compile` (RuntimeError: accessing tensor output of CUDAGraphs)
|
|
0
|
33
|
August 1, 2025
|
The CUDA kernel produces different results when running in CUDA Graph mode compared to non-CUDA Graph mode
|
|
0
|
16
|
July 30, 2025
|
Skip dynamo when using 'cudagraph' backend
|
|
0
|
31
|
July 28, 2025
|
Getting the fx graph of submodules, instead of 'call_module' nodes?
|
|
1
|
731
|
July 24, 2025
|
Using megacache vs saving /tmp/torchinductor_root
|
|
0
|
59
|
July 21, 2025
|
Is there any way to save multiple graphs from same model without redundantly saving their state dicts?
|
|
2
|
50
|
July 19, 2025
|
_inductor/schduler.py line#3161
|
|
0
|
19
|
July 6, 2025
|
Why pytorch use triton as backend of inductor?
|
|
2
|
110
|
July 3, 2025
|
Disabling guards generation using dynamo based export
|
|
1
|
46
|
June 30, 2025
|
Memory Footprint of Eager Mode vs Torch Compiled (Default, with CUDA Graph, and CUDA Graph Dynamic)
|
|
4
|
1709
|
June 27, 2025
|
Torch.compile _softmax in MultiheadAttention does not return same value as eager
|
|
4
|
53
|
June 26, 2025
|
Graph exported by torch.onnx.dynamo_export
|
|
2
|
503
|
June 24, 2025
|
Issues with torch.compile in DistributedModelParallel (DMP)
|
|
1
|
29
|
June 19, 2025
|
Issues with PyTorch while_loop operator when exporting the torch model to ExecuTorch
|
|
1
|
50
|
June 19, 2025
|
Building from GIT error: PyTorch needs CUDNN-8.5 or above but found 8.2.4
|
|
4
|
72
|
June 17, 2025
|