Label smoothing with CTCLoss

  1. Your ctc_loss looks straightforward, so there shouldn’t be any issues.
  2. In kl_tar:

why are you using kl_inp (which uses input log_probs) for creating the kl_tar? Should it be dependent on the targets input instead?