MSELoss and torch.max

On the other hand, it seems hard to propagate the gradient back if you look at this post