The loss of Wasserstein GANs is typically unconstrainted:
But I have seen examples of some WGAN metric that is steadily decreasing, for example this
from the original Wasserstein GAN paper. But how can this “Wasserstein metric” easily be calculated?
Generally, the discriminator loss is just D(x) - D(G(Z)) and the generator loss is D(G(Z)) but these produce losses akin to the top graph above. Admittedly I lack the math skills of understanding the original WGAN paper.