I have different audio files in .wav format. In each audio file there is a female/male voice saying one word in Kazakh language. I would like to increase the samples, so that from 1 audio file I can generate 3-4 samples with clear pronunciation of the word but with different voice. I have tried different augmentations, but it didn’t help that much.
Is there any other options?