Similar to fp16 inference in pytorch framework like [Training With Mixed Precision :: NVIDIA Deep Learning Performance Documentation](Training With Mixed Precision :: NVIDIA Deep Learning Performance Documentation), is there any framework about fp8 inference in pytorch?
Thank you for your time.