MFCC extracterted by librosa PyTorch

talmaashani2016 · September 15, 2022, 6:38pm

Thank you very much for quick response. I am extracting the MFCC Features and passing them to the CNN, Moreover, one last question, when I wanted to double check by extracting MFCC using TorchAudio, I am not getting the same output image.

Librosa MFCC image is as follow:
test1

And I am trying to set the same default parameters to pytorch but with no luck

Librosa Code:

y,sr = librosa.load (file_path)
S = librosa.feature.mfcc(y=y, sr=sr)
 img = librosa.display.specshow(S, sr=sr)
plt.axis('off')
 plt.savefig(f'image.png', bbox_inches='tight', pad_inches = 0)
plt.show()

I have tried using torchaudio, but it is not giving the same results:

    waveform, sample_rate = torchaudio.load (file_path)
            spectrogram_tensor = torchaudio.transforms.MFCC(sample_rate = 22050, n_mfcc = 256,   melkwargs={
                "n_mels": 256,
                "n_fft": 2048,
                "win_length": None,
                "hop_length":512,
                "mel_scale": "htk",
            }, )(waveform)
            plt.imsave (f'test.png', spectrogram_tensor[0,:,:].numpy(),  vmin = -80, vmax=0, origin="lower", cmap='viridis')

Thank you