Primary competition visual

GIZ NLP Agricultural Keyword Spotter

Helping Uganda
$7 000 USD
Challenge completed almost 5 years ago
Classification
Automatic Speech Recognition
Natural Language Processing
737 joined
253 active
Starti
Sep 11, 20
Closei
Nov 29, 20
Reveali
Nov 29, 20
User avatar
Lone_Wolf
University of ghana
Beginners Approach to Audio Classification
Notebooks · 12 Nov 2020, 17:35 · 2

Hello fellow Aspiring Data Scientists, As one of the first timers to Audio Classification.. I'd like to get access to tutorial content on the task . Because from my point of view i'm finding difficulty in understanding how to make the models learn accurately from audios converted to images by applying different image transforms. Any form of resource helpful in understanding the topic would be much appreciated... Also Personal explanations would help a bunch.. Thanks

Discussion 2 answers

I have been having the same problem too. For audio i think you can not use much transformations except resize the spectogram. Unless i have been researching wrong stuff. If you find help do share.

12 Nov 2020, 17:55
Upvotes 0
User avatar
pmwaniki
Kemri wellcome trust research programme

I have found the following blog post very helpful: https://www.assemblyai.com/blog/end-to-end-speech-recognition-pytorch . As far as I can tell, data augmentation is done on the raw signal before converting into a spectrogram: https://medium.com/@makcedward/data-augmentation-for-audio-76912b01fdf6