|
About the autograd category
|
|
0
|
4055
|
May 13, 2017
|
|
Continuous CPU RAM growth during LSTM training with cuDNN enabled (PyTorch 2.7.1 / CUDA 12.8 / cuDNN 9.10.0.2)
|
|
0
|
25
|
June 2, 2026
|
|
How does autograd deal with minibatches?
|
|
1
|
38
|
May 29, 2026
|
|
Using Jax code with PyTorch code
|
|
16
|
2275
|
May 26, 2026
|
|
Truncating part of a computational graph
|
|
0
|
35
|
May 19, 2026
|
|
Building a PyTorch curriculum that teaches math through code, not as a prerequisite
|
|
0
|
44
|
May 19, 2026
|
|
Plotting the gradient computation graph including values
|
|
0
|
24
|
April 15, 2026
|
|
Segmentation fault when calling .backward() after moving data to GPU (PyTorch + CUDA 12.1)
|
|
8
|
349
|
April 14, 2026
|
|
How to get the version numbers of a Module's Parameters?
|
|
5
|
905
|
March 13, 2026
|
|
Requires_grad becomes false after some operation
|
|
3
|
82
|
March 3, 2026
|
|
Problem of freeze metrics after first epoch
|
|
1
|
54
|
February 28, 2026
|
|
Function 'Scaled Dot Product Efficient Attention Backward0' returned nan values in its 0th output
|
|
13
|
2187
|
February 9, 2026
|
|
RNN memory management: RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation
|
|
2
|
63
|
February 6, 2026
|
|
About the sign of gradients from token probability w.r.t. intermediate activations during inference
|
|
1
|
51
|
January 12, 2026
|
|
Using `forward_pre_hook` to attribute CUDA OOMs to module execution context
|
|
2
|
68
|
January 8, 2026
|
|
PyTorch CPU RAM Usage Grows Rapidly When Assembling Forces from CNN Output—How to Prevent Memory Leak?
|
|
0
|
64
|
December 11, 2025
|
|
Simple extension of autograd saved tensor hook mechanism
|
|
4
|
82
|
December 10, 2025
|
|
Does scaled_dot_product_attention's backward support reproduce
|
|
0
|
53
|
December 10, 2025
|
|
How to make a manually changed loss work in backpropagation
|
|
1
|
45
|
December 3, 2025
|
|
torch.autograd.Function and free function
|
|
2
|
61
|
December 3, 2025
|
|
Autograd and dead-code elimination
|
|
2
|
117
|
November 18, 2025
|
|
Does PyTorch muon optimizer supports 4D weights?
|
|
1
|
148
|
November 17, 2025
|
|
Gradient ascent on some parameters while descent on others in a single model
|
|
3
|
93
|
November 11, 2025
|
|
Batchnorm and back-propagation
|
|
8
|
4123
|
November 3, 2025
|
|
How to debug origin of nans in gradient of custom module
|
|
3
|
100
|
October 29, 2025
|
|
Optimizing a mask instead of weights
|
|
2
|
103
|
October 27, 2025
|
|
PyTorch AD with non-python functions
|
|
1
|
59
|
October 22, 2025
|
|
Brenier maps in Pytorch
|
|
1
|
92
|
October 18, 2025
|
|
Updating Selected parameters in each epoch
|
|
1
|
57
|
October 17, 2025
|
|
No grad & Autocast not working together
|
|
1
|
74
|
October 14, 2025
|