☎️ Trending Now: Accuracy significantly differe...

Expresso Churn Prediction Challenge

Helping Senegal

$1 000 USD

Completed (over 4 years ago)

Skills you will learn

Classification

Prediction

1402 joined

437 active

Info Data Chat Leaderboard

Start

Aug 27, 21

Nov 28, 21

Reveal

Nov 28, 21

MahadAhmed700

National university of science and technology

Accuracy significantly different in X_test & test

Help · 5 Nov 2021, 08:42 · 4

hey there, i am having a significant difference in accuracy when i test my model on X_test (30% of the train.csv that i am using for accuracy prediction) and test.csv. Like when i used X_test to predict the model's accuracy, its giving of total 0.86 and when i use the same model to predict churn for test.csv and upload the solution, its giving an accuracy of 0.5. Is it normal or its an error indication cause i have cross checked multiple times?

Discussion 4 answers

kiminya

Strathmore university

Hello,

The LB evaluation metric is AUC. Could you check the AUC of your local validation set? Note that it's possible to have a very high accuracy but low (~0.5) AUC.

Also make sure your predictions are probabilities between 0 and 1, not absolute 0,1.

5 Nov 2021, 09:09

Upvotes 0

MahadAhmed700

National university of science and technology

thankyou so much, i was not using Area Under the Curve as the evaluation matrix and also using churn as absolute 0 & 1. thanks alot.

replied to kiminya5 Nov 2021, 10:45 (edited ~3 hours later)

Upvotes 0

tobi_ace

Hi ensure you are predicting probabilities instead of absolute values. for most sklearn algorithms, switching 'predict(test)' to 'predict_proba(test)[:,1]' should do the trick

5 Nov 2021, 10:44

Upvotes 0

MahadAhmed700

National university of science and technology

thats exactly what solved my problem. thankyou

replied to tobi_ace5 Nov 2021, 13:39

Upvotes 0

Join the largest network for
data scientists and AI builders

About FAQs

Status