GIZ NLP Agricultural Keyword Spotter
$7,000 USD
Classify audio utterances in Luganda and English from Uganda
688 data scientists enrolled, 246 on the leaderboard
AgricultureClassificationAudioNLPUnstructuredSDG2
Uganda
11 September—29 November 23:59
Ends in 54 hours
Beginners Approach to Audio Classification
published 12 Nov 2020, 17:35

Hello fellow Aspiring Data Scientists, As one of the first timers to Audio Classification.. I'd like to get access to tutorial content on the task . Because from my point of view i'm finding difficulty in understanding how to make the models learn accurately from audios converted to images by applying different image transforms. Any form of resource helpful in understanding the topic would be much appreciated... Also Personal explanations would help a bunch.. Thanks

I have been having the same problem too. For audio i think you can not use much transformations except resize the spectogram. Unless i have been researching wrong stuff. If you find help do share.

I have found the following blog post very helpful: https://www.assemblyai.com/blog/end-to-end-speech-recognition-pytorch . As far as I can tell, data augmentation is done on the raw signal before converting into a spectrogram: https://medium.com/@makcedward/data-augmentation-for-audio-76912b01fdf6