About the torch.compile category
|
|
0
|
1185
|
January 9, 2023
|
torch._dynamo.exc.Unsupported: call_method NNModuleVariable() _sa_block [TensorVariable(), LazyVariableTracker(), LazyVariableTracker()] {}
|
|
1
|
429
|
April 29, 2025
|
How to use triton kernels with pytorch?
|
|
1
|
18
|
April 29, 2025
|
What is ComputedBuffer?
|
|
1
|
199
|
April 24, 2025
|
Compiled matmul is slower than vanilla matmul
|
|
1
|
21
|
April 24, 2025
|
FP8 `torch.empty` doesn't work under `inductor` of pytorch 2.4.1
|
|
2
|
15
|
April 24, 2025
|
Device "meta" and device "cuda:0" error
|
|
0
|
15
|
April 23, 2025
|
Resetting cache in benchmark
|
|
4
|
1420
|
April 23, 2025
|
Torch.compile cpp backend not fusing OuterLoopFusedSchedulerNode properly
|
|
2
|
12
|
April 22, 2025
|
Higher Peak Memory on First Run with torch.compile and Inductor Backend
|
|
0
|
14
|
April 21, 2025
|
How to prevent dispatching `torch._C._nn.upsample_bilinear2d` to vectorize implementation `torch.ops.aten.upsample_bilinear2d.vec`
|
|
0
|
11
|
April 21, 2025
|
Why adaptive_avg_pool1d became adaptive_avg_pool2d in make_fx?
|
|
1
|
7
|
April 21, 2025
|
C++/Cuda / aot.compile and cuda graph
|
|
1
|
470
|
April 18, 2025
|
How does torch dynamo set custom eval frame callback?
|
|
1
|
15
|
April 16, 2025
|
TorchRuntimeError: Failed running call_module L__self___position_embedding_table(*(FakeTensor(..., device='mps', ...), **{}): Unhandled FakeTensor Device Propagation for aten.index_select.default, found two different devices mps:0, mps
|
|
0
|
11
|
April 14, 2025
|
Torch.compile emits double backward error even when the corresponding function is disabled
|
|
5
|
41
|
April 14, 2025
|
Is the use of CUDA Graph Python API discouraged?
|
|
2
|
32
|
April 13, 2025
|
Inductor codegen debugging
|
|
0
|
16
|
April 12, 2025
|
When will re-autotune mm kernel
|
|
0
|
25
|
April 9, 2025
|
Compiling variable-size cross entropy
|
|
0
|
17
|
April 7, 2025
|
How does pytorch triton work with dynamic shapes?
|
|
0
|
33
|
March 31, 2025
|
Torch.compile() causes RuntimeError: element 0 of tensors does not require
|
|
1
|
18
|
March 31, 2025
|
Error: accessing tensor output of CUDAGraphs that has been overwritten by a subsequent run
|
|
0
|
94
|
March 30, 2025
|
Need help debugging torch.compile issue
|
|
0
|
41
|
March 26, 2025
|
Dynamic slicing
|
|
5
|
72
|
March 24, 2025
|
Best way to detect and fix graphs breaks
|
|
5
|
113
|
March 24, 2025
|
Is it possible for torch SDPA to be slower than manual attention?
|
|
5
|
50
|
March 24, 2025
|
Torch.compile dynamic shapes in non-functional style
|
|
1
|
23
|
March 23, 2025
|
SYCL: feature test compile failed!
|
|
0
|
32
|
March 23, 2025
|
torch.cuda.is_available()=False;torch._C._cuda_getDeviceCount() > 0
|
|
0
|
63
|
March 20, 2025
|