Now torch has supported fp8 tensor, and fp8 nn.Linear.
Any plans to support fp8 nn.Conv2d and other regular modules ?
If not, could anybody provide some clue about where I should start to custom the nn.Conv2d for fp8 ? Should I start from cudnn c++ coding ? I’ve no idea at all, thanks.