PyTorch Forums

Topic	Replies	Views	Activity
How to add L2 Regularization to Parameter groups	2	25	April 25, 2025
Can Anyone help me with this!(Weighted Loss)	1	15	April 25, 2025
VAEGAN - Multiple losses and multiple networks training	6	54	April 25, 2025
Handling a priori on covariate variables for RNN data	0	7	April 25, 2025
Advice for training deeper networks on very small datasets vision	3	40	April 25, 2025
Training Loss and Accuracy both increasing while training DP-SGD Opacus	0	7	April 25, 2025
Does pytorch supports fp16xfp16->fp32 matmul?	0	7	April 25, 2025
What is ComputedBuffer? torch._inductor	1	203	April 24, 2025
Compiled matmul is slower than vanilla matmul torch.compile	1	23	April 24, 2025
FP8 `torch.empty` doesn't work under `inductor` of pytorch 2.4.1 torch.compile	2	17	April 24, 2025
Handle important gaps in TimeSeriesDataSet data	0	10	April 24, 2025
FSDP all-gather during backward pass distributed	4	1237	April 24, 2025
Switch loss function causes "RuntimeError: Found dtype Double but expected Float" quantization	6	1702	April 24, 2025
Torch ppc64le wheels	1	72	April 24, 2025
Torchtune for base model	0	6	April 24, 2025
Distributed Collectives distributed	5	32	April 23, 2025
Matmul slows down when doing communication overlapping distributed	1	19	April 23, 2025
Introduction to Libuv TCPStore Backend distributed	1	27	April 23, 2025
I want to collbrate with you vision	2	39	April 23, 2025
Can't run forward pass of WaveRNN model due to unsuccessful GPU RAM allocation audio	0	29	April 23, 2025
Multiple forwards and comp graph building autograd	2	12	April 23, 2025
Torch.fx symbolic_trace failing with TypeError on torch.ones: “slice indices must be integers or None”	0	7	April 23, 2025
Device "meta" and device "cuda:0" error torch.compile	0	20	April 23, 2025
How to fix “CUDA error: device-side assert triggered” error?	16	77305	April 23, 2025
Is there an out-of-place equivalent to boolean indexing assignment?	2	43	April 23, 2025
My Discriminator model collapsed and always returns 1s autograd	0	9	April 23, 2025
Discrepancy Between Theoretical and Measured FLOPs/token for LLaMA-4 Scout 17B (MoE)	0	10	April 23, 2025
Resetting cache in benchmark torch._dynamo	4	1435	April 23, 2025
Incredibly long torch loading	7	3862	April 23, 2025
Increase computational cost while repeating use the trained CNN models	6	36	April 22, 2025