Primary competition visual

Zimnat Insurance Recommendation Challenge

Helping Zimbabwe
$5 000 USD
Completed (over 5 years ago)
Prediction
Collaborative Filtering
1777 joined
612 active
Starti
Jul 01, 20
Closei
Sep 13, 20
Reveali
Sep 13, 20
Is it normal to use test data in train
Help · 7 Sep 2020, 09:13 · edited 4 minutes later · 7

Hi,

1) it normal to use test data with ones in train process, assuming that rows are already known, and will add some info in training process.

2) how handle values test values that miss from train and vice versa.

Discussion 7 answers

I have used Rare encoding then Wait of evidence encoding, gave me 0.08. the Rare encoding with mean aberage incoding gives the same.

7 Sep 2020, 09:14
Upvotes 0

If you use test data for training aswell your models are going to overfit and not generalize well.

They are various ways of dealing with missing values such as imputing by the mean or mode.

7 Sep 2020, 09:17
Upvotes 0

i think there is no missing values just one for date

Some occupation codes from test are missing from train and vice versia

I used 3 validation , target mean encoding and some feature enginering and get good results, with a lot of missing values in test after target encoding.

You mean from the test data. I hardly observed this

but in that case how you consider other policy values in test set out of 21 you know only 1 correct what about the rest

7 Sep 2020, 09:17
Upvotes 0