|
About the C++ category
|
|
1
|
4046
|
August 23, 2024
|
|
[RFC] Dispatch Stack Migration for Batch Norm
|
|
0
|
11
|
March 16, 2026
|
|
How does memory allocation work precisely in PyTorch?
|
|
2
|
85
|
March 3, 2026
|
|
At / tensor indexing helper for stable API
|
|
0
|
27
|
February 5, 2026
|
|
Implementing/Optimizing custom scatter_reduce op with 'memorized' indices
|
|
0
|
33
|
February 5, 2026
|
|
Dictionary Input/Output in C++
|
|
2
|
46
|
February 4, 2026
|
|
Custom ops library with new type of neuron for PyTorch
|
|
0
|
31
|
January 22, 2026
|
|
Fast symmetric matrix per vector multiplication
|
|
2
|
35
|
January 21, 2026
|
|
I built a Candy AI clone, but the AI characters don’t maintain consistent personality — how do you fix this?
|
|
0
|
92
|
January 14, 2026
|
|
Super slow training in M5 Mac compared to 4050 Nvidia GPU, why?
|
|
3
|
386
|
January 12, 2026
|
|
The multiple registration of TORCH_LIBRARY triton in PrivateUse1
|
|
0
|
66
|
January 9, 2026
|
|
Model cannot be loaded with torch::jit::load
|
|
5
|
5755
|
January 8, 2026
|
|
Long build time for custom c++/cuda extension
|
|
1
|
120
|
January 7, 2026
|
|
Building from source: nvcc fatal : Unsupported gpu architecture 'compute_120'
|
|
4
|
2667
|
December 25, 2025
|
|
LibTorch C++ support for Intel iGPU (Integrated Graphics) on Windows
|
|
0
|
39
|
December 18, 2025
|
|
Random Memory Spikes (1300MiB) in LibTorch C++ Sequential Inference (Base: 300MiB)
|
|
0
|
27
|
December 15, 2025
|
|
Print statement changes behaviour of kernel calculation(upsample code ran on cpu)
|
|
0
|
27
|
December 10, 2025
|
|
In resolving DispatchKeySet, is PythonDispatch called first or last?
|
|
1
|
61
|
November 14, 2025
|
|
Not able to include cusolverDn.h
|
|
16
|
6931
|
November 10, 2025
|
|
Libtorch multi GPU training
|
|
2
|
172
|
November 6, 2025
|
|
MSE loss - shape result info
|
|
0
|
45
|
November 3, 2025
|
|
LibTorch & Abseil LOG() conflict
|
|
1
|
54
|
October 30, 2025
|
|
No x86_64-linux-gnu-g++ version bounds defined for CUDA version
|
|
0
|
156
|
October 29, 2025
|
|
[CUDA/MSVC][Suggestion] ROI Pool half-precision build error due to ambiguous comparison
|
|
2
|
58
|
October 20, 2025
|
|
NumWorkers > 0 crashes
|
|
3
|
73
|
October 20, 2025
|
|
[libTorch] model initialization on multiple devices for parallel inference
|
|
2
|
185
|
October 19, 2025
|
|
RuntimeError: miopenStatusUnknownError
|
|
1
|
108
|
October 18, 2025
|
|
AOT-inductor run_impl uses a process-wide allocator: can two AOTIModelPackageLoader workers in the same process break each other’s CUDA Graphs?
|
|
0
|
58
|
October 15, 2025
|
|
Segfault Using Kineto With Libtorch
|
|
3
|
498
|
October 14, 2025
|
|
How to prevent compiling?
|
|
0
|
54
|
October 14, 2025
|