|
About the Memory Format category
|
|
0
|
1627
|
March 18, 2020
|
|
Sparse bmm cause CUDA misaligned address error
|
|
2
|
88
|
October 23, 2025
|
|
Free all GPU memory used in between runs
|
|
4
|
28684
|
August 18, 2025
|
|
Output mismatch with channels_last_3d?
|
|
1
|
73
|
July 9, 2025
|
|
Question: What takes so much gpu memory?
|
|
2
|
108
|
July 6, 2025
|
|
Understand the memory allocation visualization
|
|
1
|
120
|
July 1, 2025
|
|
Why the reserved memory is much larger than occupied memory?
|
|
1
|
104
|
June 25, 2025
|
|
Why does GPU memory usage not double when loading two identical models in PyTorch?
|
|
2
|
119
|
June 9, 2025
|
|
Questions about GPU memory usage
|
|
2
|
372
|
May 8, 2025
|
|
Out of Memory During LoRA Fine-Tuning on LLAMA-4-Scout-17B with H100 (80GB VRAM)
|
|
0
|
223
|
May 4, 2025
|
|
Method for efficiently transferring non-autograd tensors to CPU from GPU?
|
|
4
|
150
|
April 9, 2025
|
|
Clip Grad Norm on GPU without sync
|
|
1
|
118
|
March 30, 2025
|
|
Redefining model with fewer parameters → out of memory (OOM)?
|
|
1
|
81
|
February 19, 2025
|
|
DtoH transfer of a partial tensor or untyped_storage to tensor without memory allocation
|
|
0
|
70
|
February 19, 2025
|
|
How can I decrease the pytorch confidence to hold too much reserved memory
|
|
3
|
430
|
January 8, 2025
|
|
`torch.cuda.is_available()` allocates unwanted memory?
|
|
2
|
586
|
December 11, 2024
|
|
Guarantee traversal order for optimiser states
|
|
0
|
136
|
October 12, 2024
|
|
How to share data among DataLoader processes to save memory
|
|
6
|
14520
|
October 10, 2024
|
|
Libtorch CPP Api for Memory Format Channels Last
|
|
1
|
112
|
September 7, 2024
|
|
Replacing torch.zeros internals with cudaMemset instead of fill kernel
|
|
2
|
351
|
September 5, 2024
|
|
Fancy idexing memory footprint
|
|
0
|
168
|
August 19, 2024
|
|
Unable to free all GPU memory
|
|
3
|
548
|
August 12, 2024
|
|
Why does tf.tile not make use of strided layout? And what about "inverse" strides?
|
|
0
|
130
|
August 3, 2024
|
|
Advanced Slicing
|
|
3
|
290
|
July 28, 2024
|
|
Why aren't inputs to conv1d channels last?
|
|
3
|
2317
|
July 22, 2024
|
|
Frombuffer() → "The given buffer is not writable"
|
|
1
|
1743
|
June 19, 2024
|
|
TensorDataset with lazy loading?
|
|
4
|
1761
|
June 7, 2024
|
|
Understanding GPU memory visualization result
|
|
1
|
243
|
May 11, 2024
|
|
Do operations between tensors and scalars move the tensor to CPU?
|
|
5
|
1307
|
April 20, 2024
|
|
Understanding error msg "view size is not compatible with input tensor's size and stride"
|
|
5
|
3808
|
April 19, 2024
|