💰 Join the Buzz: Is it normal to use test... - 362 Views

Compete Jobs Learn Chat Leaderboard

More

For Business Partners Meet the team Press Case studies AI4EAC

Primary competition visual

Zimnat Insurance Recommendation Challenge

Helping Zimbabwe

$5 000 USD

Completed (almost 6 years ago)

Skills you will learn

Prediction

Collaborative Filtering

1784 joined

612 active

Info Data Chat Leaderboard

Start

Jul 01, 20

Close

Sep 13, 20

Reveal

Sep 13, 20

Is it normal to use test data in train

Help · 7 Sep 2020, 09:13 · edited 4 minutes later · 7

Hi,

1) it normal to use test data with ones in train process, assuming that rows are already known, and will add some info in training process.

2) how handle values test values that miss from train and vice versa.

Discussion 7 answers

I have used Rare encoding then Wait of evidence encoding, gave me 0.08. the Rare encoding with mean aberage incoding gives the same.

7 Sep 2020, 09:14

Upvotes 0

If you use test data for training aswell your models are going to overfit and not generalize well.

They are various ways of dealing with missing values such as imputing by the mean or mode.

7 Sep 2020, 09:17

Upvotes 0

i think there is no missing values just one for date

replied to darrel7 Sep 2020, 09:18

Upvotes 0

Some occupation codes from test are missing from train and vice versia

replied to anishjain7 Sep 2020, 09:18 (edited 17 minutes later)

Upvotes 0

I used 3 validation , target mean encoding and some feature enginering and get good results, with a lot of missing values in test after target encoding.

replied to darrel7 Sep 2020, 09:22

Upvotes 0

You mean from the test data. I hardly observed this

replied to garikhgh7 Sep 2020, 13:17

Upvotes 0

but in that case how you consider other policy values in test set out of 21 you know only 1 correct what about the rest

7 Sep 2020, 09:17

Upvotes 0

Join the largest network for
data scientists and AI builders

Privacy Policy Terms of Use Rules

© Zindi 2026