Latest mixed-precision topics

Topic	Replies	Views	Activity
How to use float16 for all tensor operations?	4	908	January 1, 2024
How to switch mixed-precision mode in training	2	412	December 26, 2023
Gradient with Automatic Mixed Precision	2	466	November 23, 2023
Changing dtype drastically affects training time	1	410	November 15, 2023
AMP on cpu: No Gradscaler necessary / available?	1	927	November 14, 2023
Subnormal FP16 values detected when converting to TRT	4	3228	November 6, 2023
Does torch.cuda.amp support O2 almost FP16 training now?	1	568	November 2, 2023
Why would GradientScaler work	3	374	October 28, 2023
Training loss behaves strangely in mixed-precision training	5	647	October 20, 2023
Gradients'dtype is not fp16 when using torch.cuda.amp	3	545	October 20, 2023
Model distillation with mixed-precision training	4	478	October 9, 2023
Unexpected execution time difference for identical operations on GPU	8	544	September 25, 2023
Performance regression in torch 2.0 with deterministic algorithms	2	593	September 22, 2023
Is autocast expected to reflect changes to weights?	1	471	September 20, 2023
How to handle the value outside the fp16 range when casting?	6	1591	September 11, 2023
Gradients type in torch.cuda.amp	3	664	August 22, 2023
Torch autocast's gradient	3	697	August 21, 2023
Scaler.step(optimizer) in FP16 or FP32?	1	708	August 2, 2023
Amp on cpu 50x slower and high memory allocation	0	507	August 1, 2023
Why the loss_scale getting smaller and smaller?	1	541	July 17, 2023
Dataset half precision	1	514	July 11, 2023
Cudnn.allow_tf32 makes my network slower	5	663	July 4, 2023
What is the correct way to use mixed-precision training with OneCycleLR	3	683	June 20, 2023
How to replace apex.amp by pytorch amp?	1	2092	June 14, 2023
Fused mixed precision updates with PyTorch amp	4	1075	June 7, 2023
Fp16 matmul - CUDA kernel output differs from torch	2	909	May 31, 2023
Jetson Nano AMP varied inference time	0	726	May 22, 2023
Gradient Accumulation failing	2	750	May 17, 2023
Why to keep parameters in float32, why not in (b)float16?	4	3192	May 15, 2023
Crash in BCEWithLogitsLoss	7	985	April 26, 2023