Is n_frames equivalent to librosa’s duration?
pytorch_version: 1.9.0+cu111
torchaudio_version: 0.9.0
I’m currently working in audio classification task where I found that if the audio file is provided entirely in load_audio() function, it provides the stereo output of the sampled version with default sampling_rate
But, if i provide offset and n_frames into the function, it doesn’t resample it.
Actually I have a doubt that is n_frames is equivalent to librosa’s duration.
In my case the offset is 30 and duration is 10 seconds
reading the audio file with offset and n_frames