Primary competition visual

Adbot Ad Engagement Forecasting Challenge

Helping South Africa
$500 USD
Completed (almost 2 years ago)
Forecast
452 joined
113 active
Starti
Apr 04, 24
Closei
May 19, 24
Reveali
May 19, 24
How to break the 110 barrier?
Help Ā· 25 Apr 2024, 04:06 Ā· 5

I've noticed that there are several identical scores "112.4561452" on the list. I'm wondering if this is due to the data. I've been trying for a while now, but I'm still around 110 and haven't seen much improvement.

Discussion 5 answers
User avatar
ahmedo42

112.45 is what you get if your submission is only zeros clicks per test sample

25 Apr 2024, 06:51
Upvotes 1
User avatar
Jaw22
Zindi africa

Focus, just like in real life, look out for shade(hidden anomolies and gremlins)!!!

25 Apr 2024, 20:10
Upvotes 0
User avatar
yanteixeira

Whenever I feel that I hit the wall, I start over from scratch.

26 Apr 2024, 21:37
Upvotes 1
User avatar
soulayman31
Ecole Nationale SupƩrieure d'Informatique, Algiers

Maybe you are not considering the ID on train and test.

Separate the date from the id in the sample, and consider it by applying the same label encoder on train and test and train an LGBM you will easily reach 75.

3 May 2024, 11:22
Upvotes 0
User avatar
Jaw22
Zindi africa

A cautionary note...When using ID that appear in both the train and test (which is actually the ID in the sample submission file as there are no test set, unless of cause you split the train set and developed a test set), are u not guilty of committing a special form of 'LEAKAGE', in particular I believe it constitute a case of train-test contamination as data snooping occur through the use of ID variable, so if the same 'ID' that is in the train set is also in the test set(sample submission)...a cautionary note!!! So you will climb up the leader board only to be penalised later for leakage and overfitting....just sharing a thought!!!