🚜 Join the Buzz: 6th place solution

GIZ NLP Agricultural Keyword Spotter

Helping Uganda

$7 000 USD

Completed (over 5 years ago)

Skills you will learn

Classification

Automatic Speech Recognition

Natural Language Processing

740 joined

253 active

Info Data Chat Leaderboard

Start

Sep 11, 20

Nov 29, 20

Reveal

Nov 29, 20

letfoolsdie

6th place solution

Notebooks · 1 Dec 2020, 00:28 · 4

I've shared my solution for the 6th place on the private leaderboard. Hopefully, it will not change after the code review :)

https://github.com/letfoolsdie/zindi-agricultural

In summary, the final solution is a geometric mean of several imagenet-pretrained models, trained with different parameters on spectrograms/melspectrograms, averaged first by folds and then with each other.

There's also a postprocessing, where I try to find junk test files (containing just noise) using pretrained PANN and replace models' predictions for them with constant prediction based on frequency of each class. It reduced loss a bit (by ~0.01-0.015 points)

Discussion 4 answers

aninda_bitm

Thanks a lot. Did you also experiment self supervised based on spectrograms (like using spectrograms to predict duration) and then using those weights for classification. I tried that approach but could only manage 1.71 on private leaderboard

1 Dec 2020, 04:30

Upvotes 0

letfoolsdie

I haven't thought of that actually, and it seems like a good idea to me :) I wish I've tried that, I guess it should improve predictions at least a little. Except I would try adding audio duration as a separate input to a model instead of training model to predict duration

replied to aninda_bitm1 Dec 2020, 08:03

Upvotes 0

LB_cruise

Federal University of Technology Akure

Thanks

1 Dec 2020, 12:31

Upvotes 0

anamip

Thanks so much for sharing and congratulations!

1 Dec 2020, 16:04

Upvotes 0

Join the largest network for
data scientists and AI builders

About FAQs

Status