⚠️ Join the Buzz: 1st place solution

Urban Air Pollution Challenge by #ZindiWeekendz

Helping Africa

$300 USD

Completed (almost 6 years ago)

Skills you will learn

Prediction

236 joined

134 active

Info Data Chat Leaderboard

Start

Apr 10, 20

Apr 12, 20

Reveal

Apr 12, 20

devnikhilmishra

1st place solution

Notebooks · 14 Apr 2020, 03:56 · 14

First of all I am extremely sorry for uploading my solution so late, I was busy with some work, plus a lot of cleaning was required.

Secondly I decided not to post my link on Akeelah's thread, because it was crowded with comments and I thought it would be difficult for a few people to find my link. Kudos to Akeelah and everyone who has shared their solutions.

Congratulations to everyone who participated and learnt new things in this competition, its a win for everyone.

And lastly I am thankful to Zindi for organizing such a wonderful competition. Hope there are many more ZindiWeekendz to come.

Here is my solution

https://github.com/nikhilmishradevelop/zindi-winning-solutions

Discussion 14 answers

chetan-ambi

Congratulation, Nikhil ! Thanks for sharing ur solution.

14 Apr 2020, 04:07

Upvotes 0

The_other_guy

Congratulations and Thanks for the solution

14 Apr 2020, 04:28

Upvotes 0

msamwelmollel

University of Glasgow

Hi Nikhil,

Congratulation for the winning. Can I ask you a favour for you and other top two solutions, if you don't mind could you comment your line of codes or just state objectives on some blocks of codes. This will help most of us to closely and easily follow up your code.

I appreciate and thank you for sharing with us.

14 Apr 2020, 05:09

Upvotes 0

Ogayo

African leadership university

I agree. If you can even go ahead and explain your thought process through a video, that will be much appreciated.

replied to msamwelmollel14 Apr 2020, 06:57

Upvotes 0

devnikhilmishra

msamwelmollel and Ogyao, sure I will try to make my notebook more readable, and add more comments. Thank you.

replied to msamwelmollel14 Apr 2020, 07:10

Upvotes 0

eaedk

Dakar institute of technology

i have some questions please, did you use a gridsearch firstly ??

and can you explain you feature engineering ?

14 Apr 2020, 16:49 (edited ~8 hours later)

Upvotes 0

devnikhilmishra

I added some comments and thought process about feature engineering in repo. Please check it out. I did not use any grid search, did manual tuning of hyperparams.

replied to eaedk15 Apr 2020, 02:41

Upvotes 0

eaedk

Dakar institute of technology

ok thanks

replied to devnikhilmishra15 Apr 2020, 09:25

Upvotes 0

Paul_Okewunmi

Obafemi awolowo university ile-ife

Big thanks to you , Mishra.. Now i think i have a better understanding of your solution. If i may ask, How long did it take to train on kaggle kernel, Considering that you had over 3400 features

replied to devnikhilmishra15 Apr 2020, 10:14

Upvotes 0

devnikhilmishra

Hi , it took 2-3 hours run on Kaggle for 10 folds

replied to Paul_Okewunmi15 Apr 2020, 15:29 (edited less than a minute later)

Upvotes 0

eaedk

Dakar institute of technology

i have some questions please, did you use a gridsearch firstly ??

and can you explain you feature engineering ?

15 Apr 2020, 01:12

Upvotes 0

eaedk

Dakar institute of technology

Please why do you use train data in valid_sets with simple test data ???

15 Apr 2020, 18:50

Upvotes 0

devnikhilmishra

Did not understand your question?

replied to eaedk16 Apr 2020, 06:42

Upvotes 0

kolatimiDave

University of lagos

Hello @devnikmishra, in your code i noticed you did-

for i in range(1, 20): df[f'prev_target_{i}'] = df.sort_values(by='Date')[TARGET_COL].fillna(method='ffill').shift(i).sort_index() df[f'next_target_{i}'] = df.sort_values(by='Date')[TARGET_COL].fillna(method='bfill').shift(-i).sort_index()

yeah so this is to get previous and next target yeah but the test set does not have target column so how did use those features in making preditions please explain what you did here Thanks

and this also

for i in tqdm_notebook(range(1, 15)): df[f'magic_{i}'] = df.sort_values(by='Date')[TARGET_COL].shift(i).expanding().mean().fillna(method='ffill').sort_index() df[f'magic2_{i}'] = df.sort_values(by='Date')[TARGET_COL].shift(-i).expanding().mean().fillna(method='bfill').sort_index()

please i'ld like ur explanation

17 Apr 2020, 10:08

Upvotes 0

Join the largest network for
data scientists and AI builders

About FAQs

Status