🔍 Join the Buzz: score of 0 % repeatedly using...

Xente Fraud Detection Challenge

Helping Uganda

$4 500 USD

Completed (almost 7 years ago)

Skills you will learn

Classification

2058 joined

545 active

Info Data Chat Leaderboard

Start

May 20, 19

Sep 22, 19

Reveal

Sep 23, 19

Laureta

score of 0 % repeatedly using different models on submission

Data · 12 Sep 2019, 11:52 · 8

I have gotten a score of 0 % on all the models i have submitted inclusing the xgboost, randomforest, svm, adaboost e.t.c. I would love to get a solution to this problem or anyone who also went through this problem and solved it should please help. Thanks

Discussion 8 answers

Alexwolf

You must be submitting something wrong, for example, predict_proba instead of predict?

12 Sep 2019, 12:06

Upvotes 0

Laureta

No, I used .predict() for all my models

replied to Alexwolf12 Sep 2019, 14:30

Upvotes 0

Raheem_Nasirudeen

Using Ensemble model can make it float data type convert to Integers it will work. 0.0 is not allowed use 0

12 Sep 2019, 14:16

Upvotes 0

Laureta

I typecast the FraudResult column to int using the .astype(int) method several times. And I still got a 0% score

replied to Raheem_Nasirudeen12 Sep 2019, 14:32

Upvotes 0

Fall_Khalil

I got the same problem. Have you got a solution?

16 Sep 2019, 11:33 (edited less than a minute later)

Upvotes 0

femiogundare

Hi there! I also once faced the same problem in this competition.

F1 score is the harmonic mean of precision and recall. And so, there is a tradeoff - as precision increases, recall decreases and vice-versa. The score of 0 probably means that your model(s) predicted only one class for all samples in the test data.

The solution would be to add a cost of penalization to the class of lower frequency, which is fraud(1). Models such as XGBoost, LightGBM, Random Forest and Support Vector Machines helps in such penalization.

Also, trying resampling techniques like SMOTE, NearMiss, Random Sampling, etc may also help.

Goodluck!

16 Sep 2019, 21:11

Upvotes 0

johnbagiliko

Hi there! I have been having a similar problem. Have you solved it yet?

20 Sep 2019, 15:35

Upvotes 0

Engineer

be sure you are not splitting the data, rather encode them to avoid 0 score

replied to johnbagiliko21 Sep 2019, 03:12

Upvotes 0

Join the largest network for
data scientists and AI builders

About FAQs

Status