💳 Data Talk: The magic behind 0.8+ scores

African Credit Scoring Challenge

Helping Africa

$5 000 USD

Completed (over 1 year ago)

Skills you will learn

2006 joined

1020 active

Info Data Chat Leaderboard

Start

Nov 29, 24

Jan 12, 25

Reveal

Jan 13, 25

VincentSchuler

The magic behind 0.8+ scores

Data · 13 Jan 2025, 05:50 · 30

Hi,

I'm sharing here the magic trick I found to achieve 0.8+ public score and 0.78 private.

To gain a considerable boost, you just need to set all loans to 1 if they are : 1. Ghanain (or better, if loan_type=3) 2. The last loan of the customer_id.

This method consistantly gave me a boost between +0.07 and +0.10.

You can try it on your side and report here your scores 😉

Discussion 30 answers

MakalaMabotja

Can I ask you why you choose to set it to 1?

Also do you mind having a private chat about your approach. I struggled to get beyong 0.66 on the LB dispite my cv scores being 0.91

13 Jan 2025, 06:09

Upvotes 0

VincentSchuler

I set it to 1 because last loans are often the ones with default.

To achieve better "normal" scores, I recommend great feature selection and a good CV strategy (groupkfold, grouping by customer_id).

With this, I achieved CV 0.91 and LB 0.70 (0.818 with the magic trick).

replied to MakalaMabotja13 Jan 2025, 06:16

Upvotes 4

sys_ts__

Nice magic trick ...

replied to VincentSchuler13 Jan 2025, 06:32

Upvotes 1

maelfosso

In all the training I did, I always reached a CV of more than 0.93 I can't pass a score of 0.64.

That over fitting is something I don't understand.

Any help please?

replied to VincentSchuler13 Jan 2025, 08:01

Upvotes 0

VincentSchuler

It's hard to say without more information. Make sure you have a great CV strategy (groupkfold by customer_id) and no leak in your feature.

replied to maelfosso13 Jan 2025, 20:08

Upvotes 0

Jean12899

what method did you use for feature selection?

replied to VincentSchuler14 Jan 2025, 09:12

Upvotes 0

joju

What was Ure score before applying this post-processing, I mean the original model score

13 Jan 2025, 06:23

Upvotes 0

UjunwaFatima

Hi, congratulations! Just curious, what features did you engineer?

13 Jan 2025, 07:19

Upvotes 2

maelfosso

Honestly I don't understand the trick.

Any help please

13 Jan 2025, 08:02

Upvotes 1

enigmatic

Wow that's a very thoughtful trick!

13 Jan 2025, 09:08

Upvotes 2

CodeJoe

This is very insightful. It actually works. 71.9 private and 75.1 public score when I tried it. I would like to know how you came up with this idea. Did you get insights from the data that drove you to take such an initiative?

13 Jan 2025, 14:15

Upvotes 0

AI_Maven

University of Benin

Based on the conditions stated above, did you update the target column or you created a new column?

replied to CodeJoe13 Jan 2025, 17:10

Upvotes 0

VincentSchuler

I had the intuition that the Ghanean distribution was weird when I saw the high scores on LB. So I tried a few things (I started with just the "last loan" trick, which is already very good on its own), and it worked !

replied to CodeJoe13 Jan 2025, 20:06

Upvotes 1

CodeJoe

I updated the columns after conducting the predictions and indeed worked like magic😅.

replied to VincentSchuler13 Jan 2025, 20:32

Upvotes 0

CodeJoe

I really appreciate you coming up with your solution. Big ups and congratulations once again.

replied to VincentSchuler13 Jan 2025, 20:33

Upvotes 0

AI_Maven

University of Benin

@CodeJoe can you please explain what and how you implemented it?

I'm finding it difficult to understand the idea here, is it that a new feature was created based on the conditions stated, or was it done after predictions?

replied to CodeJoe13 Jan 2025, 20:54

Upvotes 0

CodeJoe

Okay, first you predict on the test set. After that, you then update the predictions column with the conditions stated.

replied to AI_Maven13 Jan 2025, 21:01

Upvotes 1

AI_Maven

University of Benin

Bro! I really don't understand what you saw in the data or what you were thinking to come up with this trick of yours. Kudos though, the trick worked!

replied to VincentSchuler14 Jan 2025, 17:59

Upvotes 0

_Alex

Thanks for sharing. How did you come up with setting loans with Ghanain or loan_type==3 to target 1?

Especially for setting loans with loan_type==3 to target 1, it's quite weird but magic!

13 Jan 2025, 14:30

Upvotes 1

CodeJoe

Indeed!

replied to _Alex13 Jan 2025, 14:32

Upvotes 0

VincentSchuler

replied to _Alex13 Jan 2025, 20:06

Upvotes 1

_Alex

Nice intuition!

replied to VincentSchuler14 Jan 2025, 06:00

Upvotes 0

Man_bassa

Thanks for sharing. should we have both the two conditions at the same time to set it to 1 ? What about kenya ?

13 Jan 2025, 14:48

Upvotes 0

VincentSchuler

It doesn't work with Kenya !

replied to Man_bassa13 Jan 2025, 20:04

Upvotes 1

e-hokage777

When you say last loan, due you mean last loan a customer took or the loan that has the latest due date ?

13 Jan 2025, 15:29

Upvotes 0

VincentSchuler

1st option !

replied to e-hokage77713 Jan 2025, 20:04

Upvotes 1

e-hokage777

ok, thanks

replied to VincentSchuler14 Jan 2025, 08:23

Upvotes 0

Helkias

University of Abomey-Calavi

@VincentSchuler Thank you so much for taking the time to share your insights !

14 Jan 2025, 00:18

Upvotes 0

Vi_Jays

Simply magic! Thanks @VincentSchuler 😁

14 Jan 2025, 05:41

Upvotes 0

robson_dsp

Man! I'm feeling really dumb right now because I don't understand.

When you say "the Ghanean distribution was weird when I saw the high scores on LB".

Distribution of what? There is no Ghana in the training set to check the amount of 0s and 1s of the target. The test set on the other hand does but there is no target.

I wouldn't have thought of manually altering the model's predictions based on a pattern or rule that the model itself hasn't even "seen", not in a million years. I didn't understand what you saw or what you were thinking. But congratulations! Good job!

14 Jan 2025, 17:09

Upvotes 0

Join the largest network for
data scientists and AI builders

About FAQs

Status