Hello experts in DL and Pytorch,
I have multiple mp3 files with a voice of mine (and corresponding txt files)
How is it possible to train a Pytorch model, so it will make a speech-to-text generation of any text with my voice? Please give me hints/tips/ideas.
Thank you in advance.