I don’t know if I need nlp for a problem I have with a large audio data set.
let me explain why.
I only need to find all relationships between a specific outcome that is influenced by 20 hours of audio data.
The words in english, spanish, french DO mater, and they DO influence the results.
- my cost to run will approach smaller and smaller cost as I will be constructing my own data lab in the next 5 years.
- getting it most accurate is the max concern and want to learn as much about the design and ideas as possible.
would running a find all relationships method a-sync on a massive data center be the right answer instead of translating it to words.
I am a novice at this. I will be working with you guys on here a lot for a while I hope.
I want to make sure I start the design the right way to expand the training module with more and more data, but I want ALL relationships, even ones I can’t understand like the tone and inflection in the voice.
(may cut it to mono and low bit rate to run it quicker?)
basically would I not need to train the model on how to “understand” the language, only that this data made this result to get the max amount of relationships possible to find?
look forward to chatting a lot!
thanks all in advance.