South Africa is divided into over 4,000 wards. We have aggregated the target indicator and the other predictive variables from the 2011 census across all the households within each ward to create an aggregated value of each indicator per ward.
The target variable of interest is the percentage of households per ward that are both female-headed and earn an annual income that is below R19,600 (approximately $2,300 USD in 2011). For context, the poverty line in South Africa is defined as R1,183 per month per person and the average individual salary in South Africa is R20,860 per month.
The objective of this challenge is to accurately model the target indicator using the predictive variables provided in the datasets. For the purposes of this competition, we have split the wards into a train and test set. You will train your model on the 2,822 wards in the train set and apply your model to the 1,013 wards in the test set.
Take note that the variables:
Join the largest network for
data scientists and AI builders