Primary competition visual

data.org Financial Health Prediction Challenge

Helping Eswatini, Lesotho
and 2 other countries
  • Eswatini
  • Lesotho
  • Zimbabwe
  • Malawi
  • Scroll to see more
$1 500 USD
~1 month left
Prediction
Machine Learning
832 joined
355 active
Starti
Dec 12, 25
Closei
Feb 28, 26
Reveali
Mar 01, 26
DATA needs preprocessing, I share my approach,hope it helps
Data · 23 Jan 2026, 21:30 · 0

I standardized and cleaned the raw train/test tables by normalizing categorical values (unifying “N/A / don’t know / doesn’t apply” variants), explicitly encoding missingness (missing-category tokens and numeric missing flags), and applying consistent numeric handling (log/ratio features plus robust centering where appropriate). I then verified train–test schema alignment, audited missingness and distribution shift (including country-level checks), and generated stable cross-validation folds that respect the class imbalance. Finally, I froze a reproducible feature set and produced “model-ready” datasets with consistent columns for training, evaluation, and submission generation.

Discussion 0 answers