Primary competition visual

Urban Air Pollution Challenge by #ZindiWeekendz

Helping Africa
$300 USD
Challenge completed over 5 years ago
Prediction
236 joined
134 active
Starti
Apr 10, 20
Closei
Apr 12, 20
Reveali
Apr 12, 20
CV vs LB
Data · 12 Apr 2020, 14:42 · 6

Mine :

CV : 26.19 LB:31.11

Discussion 6 answers

cv 26 lb 29

12 Apr 2020, 15:01
Upvotes 0
User avatar
Mugisha_

CV: 30.26 LB: 32.35

12 Apr 2020, 15:14
Upvotes 0
User avatar
Expensya

you may use data from the same place in both train and test

try this:

from sklearn.cluster import KMeans

tr=train['Place_ID'].unique()[x:]

te=train['Place_ID'].unique()[:x]

tr=train[train['Place_ID'].isin(tr)]

te=train[train['Place_ID'].isin(te)]

X_train,y_train=tr.drop(columns=['Place_ID X Date', 'Date', 'Place_ID', 'target', 'target_min',

'target_max', 'target_variance', 'target_count','target_diff']),tr['target']

X_test,y_test=te.drop(columns=['Place_ID X Date', 'Date', 'Place_ID', 'target', 'target_min',

'target_max', 'target_variance', 'target_count','target_diff']),te['target']

12 Apr 2020, 15:28
Upvotes 0

Before giving your CV score, what is your validation strategy so that everyone can compare ?

12 Apr 2020, 16:12
Upvotes 0