Primary competition visual

Zimnat Insurance Recommendation Challenge

Helping Zimbabwe
$5 000 USD
Completed (over 5 years ago)
Prediction
Collaborative Filtering
1777 joined
612 active
Starti
Jul 01, 20
Closei
Sep 13, 20
Reveali
Sep 13, 20
User avatar
tichavona
Chinhoyi university of technology
Disparity in occupation codes
Connect · 29 Aug 2020, 20:37 · 2

Good day people,

I would like to pick your beautiful brains on the treatment of the disparity on occupation codes. It appears there are 233 unique values on train dataset and only 187 in test. I have also gone further to make enumerations which revealed 9 appear in test but not in train and 55 in train dataset but not in test. This means we have a total of 64 unique instances not found in both datasets.

Discussion 2 answers

is this is with other feature too because

30 Aug 2020, 10:25
Upvotes 0
User avatar
tichavona
Chinhoyi university of technology

It seems to be only perculiar to occupation codes. With sex it was an issue of case difference which I think is neglible in terms of effect.