Primary competition visual

Sendy Logistics Challenge

Helping Kenya
$7 000 USD
Completed (over 6 years ago)
Prediction
2180 joined
448 active
Starti
Aug 23, 19
Closei
Nov 25, 19
Reveali
Nov 26, 19
How is Order ID generated?
Data ยท 16 Nov 2019, 08:16 ยท 2

Is the Order ID autogenerated or it was assigned to the data just for the sake of this competition?

Discussion 2 answers

Can it be it was autogenerated? I agree it looks a bit strange. Average age on test set is higher than on training set, while mean Order ID is about the same. I experienced heavy information leakage, even whith a pipeline and CV. If test set comes after train set I should propably have used calendar folding. But too late now...

26 Nov 2019, 08:15
Upvotes 0

Yes it can be autogenerated in the sense of auto increment in the database. If that was the case, then you can extract some date features from it. However, I realized it might have been assigned just for the sake of the competition.

Also, if the order ID is anything to go by, then you can see that the training and test sets were randomly sampled. In fact, they were deliberately sampled to contain nearly the same number of outliers.