MFCC features for word boundary detection

How can the MFCC features extracted from a speech signal be used to perform word/sentence boundary detection with pytorch?
Also can Connectionist Temporal classification cost be used to achieve the same??