|
About the torch.compile category
|
|
0
|
1339
|
January 9, 2023
|
|
Variable sized outputs in custom operator?
|
|
1
|
15
|
April 15, 2026
|
|
JITed nn.TransformerDecoderLayer runs significantly slower than in eager mode
|
|
0
|
26
|
March 20, 2026
|
|
Inductor codegen debugging
|
|
1
|
181
|
March 18, 2026
|
|
Mark_dynamic does not work when the dimension is 1
|
|
1
|
59
|
March 18, 2026
|
|
Flex_attention: BlockMask conversion?
|
|
1
|
59
|
February 25, 2026
|
|
Nested torch.compile-d function calls with different options / CUDA graph options
|
|
0
|
33
|
February 16, 2026
|
|
Flex_attention and torch.compile
|
|
1
|
72
|
February 12, 2026
|
|
ONNX Community Survey 2026
|
|
0
|
37
|
February 11, 2026
|
|
"Offset increment outside graph capture encountered unexpectedly" is a specific PyTorch Windows issue that's known to occur with CUDA Graphs. This is a bug in PyTorch Windows builds
|
|
0
|
103
|
February 10, 2026
|
|
torch.compile(fullgraph=True) fails on nn.utils.rnn.pad_sequence with padding_side, while fullgraph=False succeeds
|
|
0
|
23
|
February 5, 2026
|
|
Error: accessing tensor output of CUDAGraphs that has been overwritten by a subsequent run
|
|
1
|
781
|
February 5, 2026
|
|
Investigating toch custom ops using torch.compile
|
|
2
|
60
|
February 1, 2026
|
|
Optimizing a TTS Model
|
|
0
|
45
|
January 30, 2026
|
|
Torch.compile on train_step incl both fwd and bwd
|
|
1
|
44
|
January 21, 2026
|
|
AOTInductor on Windows
|
|
0
|
80
|
January 19, 2026
|
|
Torch.compile Diagnostic Dashboard
|
|
1
|
51
|
January 18, 2026
|
|
Compiling a method other than forward
|
|
3
|
360
|
January 14, 2026
|
|
Torch.compile - getting `nans` with the latest version of torch
|
|
3
|
216
|
January 6, 2026
|
|
Torch.compile results in loss nan while it converges without torch.compile on mac cpu and mps
|
|
0
|
28
|
January 6, 2026
|
|
How to verify that default_decompositions successfully reduce operators to the Core ATen IR set?
|
|
0
|
20
|
December 30, 2025
|
|
NaNs with torch.compile (inductor, mode="reduce-overhead"), OOM with backend="cudagraphs"
|
|
0
|
56
|
December 23, 2025
|
|
How to save torch.compile so we don't need to re-compile
|
|
2
|
434
|
December 23, 2025
|
|
How to load state_dict if saved as torch.save(model.state_dict(), fs)
|
|
12
|
182
|
December 21, 2025
|
|
Issue when using aot_module with torch.autograd
|
|
0
|
30
|
December 19, 2025
|
|
Let HOF (torch.cond, torch.while_loop) skip compilation of input functions if needed
|
|
0
|
26
|
December 17, 2025
|
|
Autotune cache key for matrix multiplication
|
|
0
|
25
|
December 11, 2025
|
|
Torch compile cache sharing across different GPUs
|
|
3
|
208
|
December 10, 2025
|
|
Coordinate_descent_tuning errors out with torch.AcceleratorError: CUDA error: invalid argument
|
|
12
|
446
|
December 5, 2025
|
|
_GLIBCXX_USE_CXX11_ABI incompatibility
|
|
2
|
81
|
December 3, 2025
|