Sendy Logistics Challenge
$7,000 USD
Predict the estimated time of arrival (ETA) for motorbike deliveries in Nairobi
1176 data scientists enrolled, 431 on the leaderboard
23 August 2019—26 November 2019
cv score vs leaderboard
published 11 Sep 2019, 09:21

Is there a way to find an equivalence between leaderboard & cv ? Leaderboard is giving 475 , CV is giving 500 000 even on smaller folds than the test's length. how is that possible? I'm using mean_squared_error and r² as metrics.

"The error metric for this competition is the Root Mean Squared Error", so may be you should calc root from 500000?

Yeah forgot to add the sqrt lol. Thanks a lot!

still getting 20 seconds of difference between lb/cv scores. I just wanna make sure, is it normal in case train/test are a bit different, or do i need to make a better split to have coherent cv/lb scores?

It depends. Are you using a linear model or a tree model?

Hey DrFad, i'm using a tree model. I gave up on trying to create a perfect split to match what's the in test set. I'm trusting my cv's direction now. ( Also strangely some features disturb the difference even more )

Wise choice in trusting your cv score.

Followed the advice of the more experimented data scientists ;)