I just used new
true_divide method on pytorch 1.6.0.
It possibly has a bug which took me quite a lot of time to trace back.
torch.tensor().true_divide(15).ceil().long() gives 29 (correct result), while its cuda version
torch.tensor().to("cuda:1").true_divide(15).ceil().long() gives 30.
It might be something related to precision of math operators on cuda.
Do you guys know how to fix it?