Audio Classification
State of the Art Audio Classification – Simply
by Erik Tomica, Design Enterprise Studio Member, March 2021 Have you ever wondered how can humans recognise music and sounds in just a couple of seconds? And how applications like Shazam replicate that despite of such a vast amount of music released every year? How are humans able to tell someone’s emotion based on the tone of their voice and would a machine be able to do the same without any understanding of what emotion is? This post aims to answer some of these questions in a simple way. There won’t be any technical terminology and if so, this will be very simply explained so let’s dive right in:A lot of progress in machine learning and IT in general comes from understanding humans, how we do things and why. In the case of audio recognition and audio classification we [humans] have have taken inspiration from mother nature once again. It all started with a simple question:...