Training Conv nets with half precision

sdeng · July 7, 2017, 10:32pm

I wonder if the devs have any specific advices on training with half precision?

I converted my model to run with cuda().half(), but it seems to not be able to converge.

Is there something I should be aware of?

Thank you!

FuriouslyCurious · October 26, 2017, 11:47pm

This week Amazon has set up AWS P3 instances with Tesla V100 cards, which support half-precision training, so I am awakening this old topic.

I only have experience with fp training on Titan X. If someone has insight on how to train with half-precision on Tesla V100 or P100 cards, please share them with us!

Amir_Rosenfeld · October 31, 2017, 4:58pm

@FuriouslyCurious, did you manage to run anything at all on the v100?

ngimel · October 31, 2017, 6:22pm

Please see here for tips on training with mixed precision http://docs.nvidia.com/deeplearning/sdk/mixed-precision-training/index.html

jphoward · November 15, 2017, 5:23am

Those tips are very interesting. Has anyone got examples of implementing that in pytorch? Are there any examples of successfully training in half precision with pytorch - especially of standard architectures like resnet?