PyTorch Forums

Topic	Replies	Views	Activity
Reducing PyTorch C++ API Library Size for Inference-Only Use C++	1	69	May 26, 2025
When variables are transferred between GPUs, their values change distributed	6	67	June 10, 2025
Implementing custom .to function C++	2	199	December 21, 2024
How to fix Index put requires the source and destination dtypes match` with `google/gemma-2-2b` in Transformers?	0	158	February 13, 2025
Libtorch low latency problem C++	3	185	February 10, 2025
Optimizing the data augmentation process vision	3	170	January 24, 2025
Broken torchtext conda package?	3	174	September 20, 2024
FSDP hybrid sharding on multiple nodes distributed	1	132	March 31, 2025
How to use torch.exp() with flatten()	4	210	September 9, 2024
Torch.linalg.lstsq: (Batch element 0): Argument 6 has illegal value	5	70	February 19, 2025
Parameter loading	5	228	November 1, 2024
CUDA Out of memory with large dataset, but OK with same batch_size and context_length for small test set	5	67	October 16, 2024
Question about object localization vision	5	286	August 24, 2024
Inference with float16	2	130	February 18, 2025
Torch.load and mmap data	1	153	March 22, 2025
Flex attention raises unnecessary assertion error torch._inductor	1	126	March 12, 2025
How to calculate the activation memory usage of a model	1	109	January 16, 2025
Building pytorch from source with docker image doesnt include mpi distributed	1	198	January 10, 2025
Softmax Output Values Do Not Sum to 1 in PyTorch	1	133	September 11, 2024
Avoid cpu from idle while load data data	6	281	August 13, 2024
Autograd FLOP Calculation with Higher Order Derivatives autograd	3	173	May 9, 2025
The strangest bug in model.forward() for multiple batches C++	3	83	December 8, 2024
Reproduce qconv kernel for x86 quantization	4	177	August 19, 2024
DDP Training Hangs after completing Epoch distributed	2	99	March 21, 2025
Backward pass using distributed tensors distributed	2	253	September 16, 2024
Torch.argmin() non-differentiability workaround autograd	2	222	September 14, 2024
Reproducing results across different machines (CPU) - Dropout layers vision	2	204	August 20, 2024
Should forward() be banned from ModuleList with FSDP? distributed	2	103	July 31, 2024
"turing_fp16_s1688gemm_fp16_128x128_ldg8_relu_f2f_tn"	5	107	April 21, 2025
Teacher forcing ratio nlp	0	236	November 8, 2024
Strange Unet artifact	3	208	October 12, 2024
Cannot run Demand forecasting with the Temporal Fusion Transformer	0	166	August 22, 2024
PyTorch Profiler self cpu time vs cpu time	1	166	December 3, 2024
Can't run torch conv2d with AMD Instinct MI250X projects	1	119	March 3, 2025
Export Detectron2 model before calling model.eval() vision	1	213	August 2, 2024
Log file generation	2	226	January 2, 2025
Autograd graph traversal autograd	2	228	December 18, 2024
KeyError:'Keypoints' in Pytorch-How to resolve this? vision	2	158	November 25, 2024
Issues with Training UNet Architecture Model vision	2	90	October 22, 2024
LBFGS in multi gpu machine distributed	2	235	October 7, 2024
GPU Utilization Tutorial/Troubleshooting	2	231	August 17, 2024
RNNCell uses much more memory than RNN	2	94	July 19, 2024
Autograd and Temporary Variables autograd	4	122	May 12, 2025
Does PyTorch `.to(device)` propagate gradients back to original device?	4	126	March 12, 2025
BCEWithLogitsLoss with weights causes runtimeError vision	4	163	September 16, 2024
How does Forward-mode AD work behind the scenes in Pytorch? autograd	1	118	December 16, 2024
AssertionError: Dict types must use ConstDictVariable torch._dynamo	1	203	September 9, 2024
Pytorch with MPI backend distributed-rpc	1	178	August 12, 2024
Training hangs on loss.backward() with DDP --nnodes=2 --nproc_per_node=3 distributed	3	103	May 22, 2025
Correct Implementation of Beta-VAE Reconstruction Loss with ViT Encoder-Decoder Architecture	0	202	December 11, 2024