Right loss function for VAE with word2vec (cbow and skipgram)

Can anyone suggest a good loss function for training and to analyse the validation performance for VAE implemented for CBOW and Skipgram (word2vec). If you can suggest an online code for word2vec with VAE would be much appreciated.

