🚜 Must-Read: Text extraction from audio fil...

GIZ NLP Agricultural Keyword Spotter

Helping Uganda

$7 000 USD

Completed (over 5 years ago)

Skills you will learn

Classification

Automatic Speech Recognition

Natural Language Processing

740 joined

253 active

Info Data Chat Leaderboard

Start

Sep 11, 20

Nov 29, 20

Reveal

Nov 29, 20

Brainiac

Text extraction from audio files

Help · 23 Nov 2020, 15:43 · edited 7 minutes later · 4

@zindi

Is it okay to extract words from audio files and then map them to form pseudo labels?

For example; if the extracted word is "so", it is mapped to "sow" as a pseudo label to be used in training

Discussion 4 answers

ZINDI

Yes you may.

With image classification, models pre-trained on imagenet are somewhat of a standard, and often built into popular libraries. For audio there isn't an exact equivalent.

Obviously, we don't want a situation where someone wins because of access to something the other participants didn't have. So in general sourcing, an extra dataset (even a public one) and using that to get an edge would be a potential issue. But if you have a dataset (or even better a pretrained model) in mind that you think would help all entrants, and it's public+free, let us know and we can see about adding it as an allowed source.

24 Nov 2020, 08:36

Upvotes 0