Filter all silent segment, including front, mid, and back of a complete audio using T.vad()

lllxw · September 17, 2021, 5:22am

I have an audio file containing too much silent segment, and i want to filter the silent segments with torchaudio.functional.vad(). But the funciton can only trim the front silent part of audio, it still remains silent segment in the mid and back. Can someone tell me why? Thanks!

Besides, I want to know the meaning of each parameter in vad(). Beacuse I try to split the audio into many segments with shorter duration, like 0.03s, process each with vad, and concat the process segment together finally, but i don’t know how to set the parameters.

Thanks a lot !