Hello,
I would like to generate like librosa MFCC images using TorchAudio:
Librosa Code:
y,sr = librosa.load (file_path)
S = librosa.feature.mfcc(y=y, sr=sr)
img = librosa.display.specshow(S, sr=sr)
plt.axis('off')
plt.savefig(f'image.png', bbox_inches='tight', pad_inches = 0)
plt.show()
I have tried using torchaudio, but it is not giving the same results:
waveform, sample_rate = torchaudio.load (file_path)
spectrogram_tensor = torchaudio.transforms.MFCC(sample_rate = 22050, n_mfcc = 256, melkwargs={
"n_mels": 256,
"n_fft": 2048,
"win_length": None,
"hop_length":512,
"mel_scale": "htk",
}, )(waveform)
plt.imsave (f'test.png', spectrogram_tensor[0,:,:].numpy(), vmin = -80, vmax=0, origin="lower", cmap='viridis')