☎️ AI in Focus: Training columns

DSN Pre-Bootcamp Hackathon: Expresso Churn Prediction Challenge by Data Science Nigeria

Helping Nigeria

Knowledge

Completed (almost 6 years ago)

Skills you will learn

Classification

Prediction

671 joined

358 active

Info Data Chat Leaderboard

Start

Aug 08, 20

Aug 22, 20

Reveal

Aug 22, 20

Bambillo

Training columns

Data · 13 Aug 2020, 12:01 · 2

Some columns in the training set contain some 'nan's. Should I drop those columns or use SimpleImputer to make them numeric before training? Please I need your detailed explanation. Thank you.

Discussion 2 answers

kolatimiDave

University of lagos

You can drop the columns for sure but there may be information loss, or impute with simple imputer when you've explored the data and then you find which method to impute, else you could be adding noise, or maybe impute manually. You can also filllna with arbitrary value like -99999, train.fillna(-99999, inplace=True).

13 Aug 2020, 12:08

Upvotes 0

ucheazunna

University of Lagos

I want to know if it proper to drop the "TENURE' column. Please which colum require to be dropped?

replied to kolatimiDave13 Aug 2020, 23:20

Upvotes 0

Join the largest network for
data scientists and AI builders

About FAQs

Status