About the torch.compile category
|
|
0
|
1207
|
January 9, 2023
|
Torch.compile usage for generative models
|
|
5
|
36
|
May 28, 2025
|
How to get positional order of inputs and outputs for a graph that was exported via torch.export?
|
|
1
|
106
|
May 26, 2025
|
PyTorch Error: Compiler: cl is not found
|
|
2
|
118
|
May 21, 2025
|
How to customize leaf node in dynamo trace?
|
|
2
|
564
|
May 21, 2025
|
Torch.compile and Parameter replacement in a module
|
|
0
|
10
|
May 20, 2025
|
How to free backward input args when I use iree as the compilation backend?
|
|
0
|
12
|
May 20, 2025
|
In inductor, can we unify the way to generate code between using extern_kernels and torch.ops?
|
|
1
|
78
|
May 19, 2025
|
[PyTorch 2.7.0] Custom privateuse1 FakeTensor add + float scalar triggers AttributeError: 'float' object has no attribute 'item_memo'
|
|
0
|
26
|
May 19, 2025
|
Torch not compiled with CUDA enabled for ComfyUI
|
|
1
|
118
|
May 13, 2025
|
Torch compile: optimizer.step Generates Excessive Warning Messages
|
|
10
|
311
|
May 8, 2025
|
Where the grades from backward graph will be copied to param.grade in cuda graphs
|
|
0
|
37
|
May 7, 2025
|
Recompilations leading to timeout
|
|
0
|
48
|
May 1, 2025
|
torch._dynamo.exc.Unsupported: call_method NNModuleVariable() _sa_block [TensorVariable(), LazyVariableTracker(), LazyVariableTracker()] {}
|
|
1
|
466
|
April 29, 2025
|
How to use triton kernels with pytorch?
|
|
1
|
80
|
April 29, 2025
|
What is ComputedBuffer?
|
|
1
|
267
|
April 24, 2025
|
Compiled matmul is slower than vanilla matmul
|
|
1
|
78
|
April 24, 2025
|
FP8 `torch.empty` doesn't work under `inductor` of pytorch 2.4.1
|
|
2
|
69
|
April 24, 2025
|
Device "meta" and device "cuda:0" error
|
|
0
|
62
|
April 23, 2025
|
Resetting cache in benchmark
|
|
4
|
1587
|
April 23, 2025
|
Torch.compile cpp backend not fusing OuterLoopFusedSchedulerNode properly
|
|
2
|
67
|
April 22, 2025
|
Higher Peak Memory on First Run with torch.compile and Inductor Backend
|
|
0
|
20
|
April 21, 2025
|
How to prevent dispatching `torch._C._nn.upsample_bilinear2d` to vectorize implementation `torch.ops.aten.upsample_bilinear2d.vec`
|
|
0
|
16
|
April 21, 2025
|
Why adaptive_avg_pool1d became adaptive_avg_pool2d in make_fx?
|
|
1
|
55
|
April 21, 2025
|
C++/Cuda / aot.compile and cuda graph
|
|
1
|
499
|
April 18, 2025
|
How does torch dynamo set custom eval frame callback?
|
|
1
|
23
|
April 16, 2025
|
TorchRuntimeError: Failed running call_module L__self___position_embedding_table(*(FakeTensor(..., device='mps', ...), **{}): Unhandled FakeTensor Device Propagation for aten.index_select.default, found two different devices mps:0, mps
|
|
0
|
47
|
April 14, 2025
|
Torch.compile emits double backward error even when the corresponding function is disabled
|
|
5
|
69
|
April 14, 2025
|
Is the use of CUDA Graph Python API discouraged?
|
|
2
|
64
|
April 13, 2025
|
Inductor codegen debugging
|
|
0
|
57
|
April 12, 2025
|