At some point during computation, my model needs to compute the logarithm of softplus of a parameter. Currently, I implement this via: torch.nn.functional.softplus(theta).log() Due to the log() call I fear that there might be issues with computational stability. Is there a computationally more s…

Hi Steve! I haven’t worked through this in detail, but a first look suggests that the softplus() fix in the recent nightlies addresses your issue. [image] stevethesteve: yields 1.8.1 diffA.abs().max() = tensor(8.4096e-07, dtype=torch.float64, grad_fn=<MaxBackward1>) diffB.abs().max() = tenso…

Computationally stable log-softplus

KFrank (K. Frank) May 4, 2021, 3:25am 7

Hi Steve!

This may be related to some weirdness in pytorch’s built-in
torch.nn.functional.softplus(). Here is the relevant thread:

Best.

K. Frank