South Africa is divided into 4,392 wards. We will aggregate the target indicators (households with 5+ members and no on-premises water) and the other predictive variables from the census across all the households within each ward to create an aggregated value of each indicator per ward. Some wards are for training, the remainder are the test set. You may not use external data for this competition.
How to use Colab on Zindi
How to mount a drive on Colab
Test resembles Train.csv but without the target-related columns. This is the dataset on which you will apply your model to.
This shows the submission format for this competition, with the ‘ID’ column mirroring that of Test.csv and the ‘target’ column containing your predictions. The order of the rows does not matter, but the names of the ID must be correct.
Full list of variables and their explanations.
Train contains the target. This is the dataset that you will use to train your model.