🚜 Trending Now: Trust your CV, don't trust the...

Digital Green Crop Yield Estimate Challenge

Helping India

€9 400 EUR

Completed (over 2 years ago)

Skills you will learn

Prediction

1370 joined

677 active

Info Data Chat Leaderboard

Start

Sep 04, 23

Dec 03, 23

Reveal

Dec 03, 23

Mohamed-Eltayeb

Trust your CV, don't trust the LB

Notebooks · 4 Dec 2023, 00:07 · 3

Yeah just wanted to put this here :)

Our solution is just a single xgboost with some feature engineering and a custom loss function (Huber loss). This was our best rmse in cv (470) and worse in public lb (423) but best in private (106).

Discussion 3 answers

cliffanalyst

I totally gree.Trust your CV.I must say I was surprised at the private leaderboard score after having a CV score of 435

4 Dec 2023, 07:11

Upvotes 0

100i

Ghana Health Service

Totally agree - it's mostly not always worth focusing on the public LB. CV never lie. Also had my best scoring single model (XGB) achieve 100+ on private but very poorly on public (357)

4 Dec 2023, 10:07

Upvotes 0

Yeah, got duped by the public LB scores. My earlier work was far simpler, but scored 300s. So I preferred my later work that scored sub 200.

I don't entirely understand how, if you look at the top ten, you have private scores of 100 with public scores varying from 100 to 500. That implies that the 400+ public score solutions did great on 80% of the test set and poorly on the public 20% of the test set. While the <150 public scores did great on 100% of the test set?

4 Dec 2023, 11:32

Upvotes 0

Join the largest network for
data scientists and AI builders

About FAQs

Status