I have some audio data for which i computed the audio features. The labels for each input feature is 20 binary classes. i.e. for each input element there are 20 classes (1 or 0).
My initial understanding is that this is a multi-label classification that can be addressed using
Can you guys let me know if I got this right and to explain how