@tom
Sorry for the late response, it works! Thanks!
Another question regarding parallelization of CTC Loss:
I’m wondering how to parallelize the loss computation with log_probs
being expected in data format (T, N, C) and the other inputs in batch-first mode. Since DataParallel
expects a common dimension to scatter along, it does not seem possible to me currently.