Is there a way to compute Kaldi pitch features with gradients?

I want to compute kaldi pitch features at the end of a network. Is there a way to compute gradients of the kaldi pitch feature extraction? (Maybe something similar to spectrogram of torchaudio).



I’m not familiar with kaldi pitch, how do you compute it? If you use pytorch construct, the autograd should work just fine :smiley:

The original Kaldi library is written in C and usually called using .sh files. I do not know of a Pytorch function that implements it, although there are python envelopes such as pykaldi.

If there is no pytorch implementation, I’m afraid you’ll have to do one of the following: