Latest mixed-precision topics

Topic	Replies	Views	Activity
About the mixed-precision category	0	1424	August 24, 2020
Current CUDA Device does not support bfloat16. Please switch dtype to float16	1	71	April 26, 2024
Cuda half2 support	0	36	April 25, 2024
How much does TORCH.AMP improve performance	1	59	April 22, 2024
Why bfloat16 matmul is significantly slower than float32?	0	57	April 16, 2024
No gradient received in mixed precision training	2	54	April 12, 2024
AMP during inference	0	42	April 11, 2024
TF32 flags when using AMP	3	49	April 9, 2024
What's the use of `scaled_grad_params` in this example of gradient penalty with scaled gradients?	4	92	April 9, 2024
Bfloat16 from float16 issues	0	80	April 1, 2024
FP8 support on H100	8	2035	March 8, 2024
Converting float16 tensor to numpy causes rounding	2	231	February 26, 2024
Is Autocast Failing to Cast Gradients?	1	144	February 19, 2024
When should you not use custom_{fwd/bwd}?	0	119	February 16, 2024
Casting Inputs Using custom_fwd Disables Gradient Tracking	2	161	February 8, 2024
Wrong Tensor type when using Flash Attention 1.0.9	0	147	February 1, 2024
Autocast on cpu dramatically slow	3	221	January 29, 2024
Autocast with BCELoss() on CPU	2	263	January 18, 2024
Torch.nan not supported in int16	1	213	January 9, 2024
How to use float16 for all tensor operations?	4	542	January 1, 2024
How to switch mixed-precision mode in training	2	279	December 26, 2023
Gradient with Automatic Mixed Precision	2	291	November 23, 2023
Changing dtype drastically affects training time	1	318	November 15, 2023
AMP on cpu: No Gradscaler necessary / available?	1	588	November 14, 2023
Subnormal FP16 values detected when converting to TRT	4	2895	November 6, 2023
Does torch.cuda.amp support O2 almost FP16 training now?	1	409	November 2, 2023
Why would GradientScaler work	3	288	October 28, 2023
Training loss behaves strangely in mixed-precision training	5	451	October 20, 2023
Gradients'dtype is not fp16 when using torch.cuda.amp	3	449	October 20, 2023
Model distillation with mixed-precision training	4	391	October 9, 2023