Latest mixed-precision topics

Topic	Replies	Views	Activity
Torch.cuda.amp inferencing slower than normal	6	3204	September 12, 2022
Can autocast context manager be used around all of training loop?	13	2032	August 26, 2022
Cuda.amp slower than TF32 on NVIDIA A100?	7	1639	August 24, 2022
Loss of result precision from function convereted from numpy/TFv1 to PyTorch	12	1191	August 21, 2022
Autocast not casting tensors to float16	2	1090	August 17, 2022
Loss of result precision from function convereted from numpy to torch	0	593	August 15, 2022
Segmentation fault when running IPEX bf16 example with torch.autocast	2	771	August 11, 2022
RuntimeError: expected scalar type Half but found Float from fc layers in TorchScript	2	2514	August 3, 2022
Performance (Training Speed) of Autocast Bfloat16	3	1651	August 3, 2022
Handling GPU/CPU compute differences	1	1553	July 27, 2022
Bfloat16 training question	4	921	July 18, 2022
Mixed Precision Training on CUDA with bfloat16	2	3476	July 9, 2022
Is it OK to disable `amp` of BN by decorating its forward function?	0	526	July 5, 2022
Huggingface microsoft/mdeberta model never gets updated under AMP	2	1305	June 29, 2022
Got nan in forward with `torch.amp`	2	1278	June 29, 2022
AMP not casting custom Parameter tensor	1	638	June 28, 2022
Would the eps of 1e-8 in AdamW be rounded to zero when open `torch.amp`?	1	536	June 26, 2022
Would `torch.amp` cause a slower convergence?	4	833	June 24, 2022
Training with custom, quantized datatype	4	1504	June 5, 2022
Half precision Convolution cause NaN in forward pass	5	2640	May 26, 2022
Is there a way to force some functions to be run with FP32 precision?	2	1999	April 30, 2022
AMP for DCGAN training	4	588	April 29, 2022
Mixed precision model using more memory in inference(Didn't compare in finetuning)	2	627	April 29, 2022
Matrix Exponential FP16 Support? Fixed order approxmation?	0	444	April 28, 2022
Mixed precision and r1 regularization	0	1481	April 21, 2022
Onnx mixed precision slow	1	649	April 21, 2022
KL divergence negative with AMP	3	749	April 19, 2022
Torch.cuda.amp.autocast breaks simplex constraint	2	1040	April 18, 2022
[CTC Loss] CTC Loss not support float16?	2	1044	April 17, 2022
Exporting batchnorm layer to onnx with autocast	2	1496	April 7, 2022