CTC loss vs NLL/CrossEntropy performance differences

Hi all,

I am having trouble finding research on models that used CrossEntropy vs CTC and their performance. It seems that if you have some kind of seq2seq task, it makes a lot of sense to use CTC but I would like to see what kind of difference I can expect. Any hints are welcome.