|
About the autograd category
|
|
0
|
4025
|
May 13, 2017
|
|
About the sign of gradients from token probability w.r.t. intermediate activations during inference
|
|
0
|
3
|
January 11, 2026
|
|
Using `forward_pre_hook` to attribute CUDA OOMs to module execution context
|
|
2
|
32
|
January 8, 2026
|
|
PyTorch CPU RAM Usage Grows Rapidly When Assembling Forces from CNN Output—How to Prevent Memory Leak?
|
|
0
|
29
|
December 11, 2025
|
|
Simple extension of autograd saved tensor hook mechanism
|
|
4
|
33
|
December 10, 2025
|
|
Does scaled_dot_product_attention's backward support reproduce
|
|
0
|
21
|
December 10, 2025
|
|
How to make a manually changed loss work in backpropagation
|
|
1
|
37
|
December 3, 2025
|
|
torch.autograd.Function and free function
|
|
2
|
47
|
December 3, 2025
|
|
Autograd and dead-code elimination
|
|
2
|
80
|
November 18, 2025
|
|
Does PyTorch muon optimizer supports 4D weights?
|
|
1
|
59
|
November 17, 2025
|
|
Gradient ascent on some parameters while descent on others in a single model
|
|
3
|
71
|
November 11, 2025
|
|
Batchnorm and back-propagation
|
|
8
|
4037
|
November 3, 2025
|
|
How to debug origin of nans in gradient of custom module
|
|
3
|
63
|
October 29, 2025
|
|
Optimizing a mask instead of weights
|
|
2
|
61
|
October 27, 2025
|
|
PyTorch AD with non-python functions
|
|
1
|
42
|
October 22, 2025
|
|
Brenier maps in Pytorch
|
|
1
|
51
|
October 18, 2025
|
|
Updating Selected parameters in each epoch
|
|
1
|
35
|
October 17, 2025
|
|
No grad & Autocast not working together
|
|
1
|
42
|
October 14, 2025
|
|
How to do back propagation with loss = ||f_{\Theta + \Delta P}(X) - Y||^2 + ||\Delta P||^2 instead of the usual loss = || f_{\Theta + \Delta}(X) - Y ||^2?
|
|
2
|
38
|
October 14, 2025
|
|
"RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.FloatTensor [64, 1]], which is output 0 of AsStridedBackward0, is at version 3; expected version 2 instead. Hint: the backtrace further a
|
|
10
|
33183
|
October 9, 2025
|
|
Defining loss that maximizes separation
|
|
0
|
39
|
October 1, 2025
|
|
Ablation Hook I created seems to affect the calculations of the gradients on later layers
|
|
5
|
73
|
September 30, 2025
|
|
How do I set the order of pytorch hooks?
|
|
5
|
78
|
September 29, 2025
|
|
Updating tensors that are used in backpropagation but are not network parameters
|
|
2
|
79
|
September 20, 2025
|
|
Making autograd saved tensors hooks specific to certain arguments
|
|
8
|
273
|
September 17, 2025
|
|
what happens when I use torch.profiler.profile with activities=[torch.profiler.ProfilerActivity.CPU, torch.profiler.ProfilerActivity.CUDA]
|
|
1
|
515
|
September 16, 2025
|
|
Autograd engine thread creation
|
|
2
|
73
|
September 8, 2025
|
|
Solving PDEs using neural networks
|
|
1
|
2462
|
September 5, 2025
|
|
Hessian Vector Product for discounted-return-based loss function
|
|
2
|
73
|
September 2, 2025
|
|
How does torch.where work in autograd
|
|
2
|
68
|
September 2, 2025
|