Latest mixed-precision topics

Topic	Replies	Views	Activity
Gradient with Automatic Mixed Precision	2	689	November 23, 2023
Changing dtype drastically affects training time	1	436	November 15, 2023
AMP on cpu: No Gradscaler necessary / available?	1	1102	November 14, 2023
Subnormal FP16 values detected when converting to TRT	4	3616	November 6, 2023
Does torch.cuda.amp support O2 almost FP16 training now?	1	701	November 2, 2023
Why would GradientScaler work	3	466	October 28, 2023
Training loss behaves strangely in mixed-precision training	5	773	October 20, 2023
Gradients'dtype is not fp16 when using torch.cuda.amp	3	577	October 20, 2023
Model distillation with mixed-precision training	4	531	October 9, 2023
Unexpected execution time difference for identical operations on GPU	8	597	September 25, 2023
Performance regression in torch 2.0 with deterministic algorithms	2	738	September 22, 2023
Is autocast expected to reflect changes to weights?	1	518	September 20, 2023
How to handle the value outside the fp16 range when casting?	6	1857	September 11, 2023
Gradients type in torch.cuda.amp	3	825	August 22, 2023
Torch autocast's gradient	3	712	August 21, 2023
Scaler.step(optimizer) in FP16 or FP32?	1	781	August 2, 2023
Amp on cpu 50x slower and high memory allocation	0	542	August 1, 2023
Why the loss_scale getting smaller and smaller?	1	614	July 17, 2023
Dataset half precision	1	555	July 11, 2023
Cudnn.allow_tf32 makes my network slower	5	780	July 4, 2023
What is the correct way to use mixed-precision training with OneCycleLR	3	759	June 20, 2023
How to replace apex.amp by pytorch amp?	1	2476	June 14, 2023
Fused mixed precision updates with PyTorch amp	4	1330	June 7, 2023
Fp16 matmul - CUDA kernel output differs from torch	2	1065	May 31, 2023
Jetson Nano AMP varied inference time	0	742	May 22, 2023
Gradient Accumulation failing	2	801	May 17, 2023
Why to keep parameters in float32, why not in (b)float16?	4	5132	May 15, 2023
Crash in BCEWithLogitsLoss	7	1216	April 26, 2023
Autocast with batch normalization in Pytorch model.eval() returns NaNs	1	1103	April 26, 2023
Can't run inference on FP16 trained model	4	1577	April 25, 2023