VAE KL divergence

I used to think that as a VAE model is trained, the KL gets smaller. But according to my training, it seems to be wrong. I was wondering if someone can explain why the KL term is increasing instead of decreasing? Is that what is expected? or maybe I’m doing something wrong here

blue = total loss
green = BCE loss
red = KL

image