Primary competition visual

DSN Pre-Bootcamp Hackathon: Expresso Churn Prediction Challenge by Data Science Nigeria

Helping Nigeria
Knowledge
Completed (over 5 years ago)
Classification
Prediction
671 joined
358 active
Starti
Aug 08, 20
Closei
Aug 22, 20
Reveali
Aug 22, 20
Training columns
Data · 13 Aug 2020, 12:01 · 2

Some columns in the training set contain some 'nan's. Should I drop those columns or use SimpleImputer to make them numeric before training? Please I need your detailed explanation. Thank you.

Discussion 2 answers
User avatar
University of lagos

You can drop the columns for sure but there may be information loss, or impute with simple imputer when you've explored the data and then you find which method to impute, else you could be adding noise, or maybe impute manually. You can also filllna with arbitrary value like -99999, train.fillna(-99999, inplace=True).

13 Aug 2020, 12:08
Upvotes 0
User avatar
University of Lagos

I want to know if it proper to drop the "TENURE' column. Please which colum require to be dropped?