💰 Join the Buzz: First place approach...

DSN AI Bootcamp Qualification Hackathon by Data Science Nigeria

Helping Nigeria

Knowledge

Challenge completed ~5 years ago

Skills you will learn

Prediction

1094 joined

749 active

Info Data Chat Leaderboard

Start

Sep 09, 20

Oct 03, 20

Reveal

Oct 03, 20

Aifenaike

University of ibadan

First place approach...

Notebooks · 4 Oct 2020, 10:59 · edited 26 minutes later · 24

This is the link to my solution:

https://github.com/aifenaike/DSN_KOWOPE

It's just a simple approach with no feature engineering and basically stacking as this was a learning experience for me... I hope it's helps..

Discussion 24 answers

Gozie

Freelance

Wow. Nice concept bro. Thanks for sharing.

4 Oct 2020, 11:19

Upvotes 0

Maayowa

Thanks for sharing your notebook.

4 Oct 2020, 11:22

Upvotes 0

wanderercn

Congratulations! and Thanks for sharing your solution.

4 Oct 2020, 11:22

Upvotes 0

Engineer

Great.. so simple approach.. thumb up man.. Congratulations

4 Oct 2020, 11:35

Upvotes 0

ajulojays

University of ibadan

Thanks

4 Oct 2020, 11:41

Upvotes 0

tiva

University of jos

This is a good one bro. I really admire its simplicity for the fact that I was expecting something complex 👏🏿👏🏿👏🏿👏🏿

4 Oct 2020, 11:54

Upvotes 0

Enigma

Obafemi awolowo university ile-ife

Thanks a lot for sharing 👍🏽, Congrats on your win. 🏆

4 Oct 2020, 12:00

Upvotes 0

sharonibejih

Thank you so much for this.

4 Oct 2020, 12:07

Upvotes 0

Adebaicy

Why the use of linear regression as a meta estimator since it is a classification task.....Pls, kindly answer my only question, I grab the rest

4 Oct 2020, 12:41

Upvotes 0

Maayowa

In the case of the meta-model prediction, he was not predicting class but float values based on the class probabilty generated from the sub-models.

replied to Adebaicy4 Oct 2020, 13:27

Upvotes 0

Adebaicy

I used a classifier as my final estimator... wrong button pressed... oops....

replied to Maayowa4 Oct 2020, 13:47

Upvotes 0

Aifenaike

University of ibadan

Alright @Adebaicy

In my case what I wanted to see was the correlation between prediction from different classifiers and as well obtain a function that would take the predicted values from the submodels and compute a float value(not another probability) what better candidate than regressors!.. beyond that the reason why I chose linear regression is quite simple for simplicity and interpretable results..

Simplicity: I tried using classifiers such as extratrees and Xgboost they failed.. and when you use classifiers it's kind of complex because you still have use predict_proba and still select the probability of defaulting.. predicting probability from probabilities lol..

Interpretable: unlike classifiers which use feature importance(features responsible for splitting at nodes) I wanted to see which model were contributing to the magnitude of the values the stack predicted what better way than the coef_ from linear regression...

replied to Adebaicy4 Oct 2020, 14:24

Upvotes 0

Aifenaike

University of ibadan

It depends there are some works on kaggle that uses classifiers to stack too ..

It depends there are some works on kaggle that uses classifiers to stack too ..attimes requires additional tuning and we had only 10 submission per day so..

replied to Adebaicy4 Oct 2020, 14:26

Upvotes 0

Olawale_Big

Federal University of Technology, Akure

Hi, thanks for sharing your Solution and congratulations! I have a question: In the case of label predictions (multi classification), how do we stack? Can stacking be done using probabilities (using linear regression as meta learner) to predict the actual labels or logistic regression for predicting the actual label? Please if you can shed more idea on how to go about it. Thanks.

replied to Aifenaike4 Oct 2020, 14:40

Upvotes 0

Rhydian

Nice concept bro!!! I was expecting something complex.

Love the simplicity.

4 Oct 2020, 13:00

Upvotes 0