Training with Half Precision

ptrblck · September 20, 2020, 11:08pm

I would generally recommend to use the automatic mixed precision package (via torch.cuda.amp), which uses casts the input to the appropriate dtype for each method.