Women's Mentorship #1: Hack for Safety by Agence Française de Développement
Can you predict which women are at highest risk of being made a victim of a crime in South Africa?
32 data scientists enrolled, 26 on the leaderboard
14 May—19 July
67 days

The train set contains ~7700 individuals with various information including if they have experienced a crime in the last 5 years or not. The test set is similiar to the train and has ~3300 individuals.

This challenge calls on you to build a machine learning model that predicts a woman’s level of risk of being victimized by a crime given basic information about her and her life.

Files available for download:

  • Train.csv - contains the target. This is the dataset that you will use to train your model.
  • Test.csv- resembles Train.csv but without the target-related columns. This is the dataset on which you will apply your model to.
  • SampleSubmission.csv - shows the submission format for this competition, with the ID column mirroring that of Test.csv and the ‘target’ column containing your predictions. The order of the rows does not matter, but the names of the ID must be correct.
  • VariableDefinition.csv - contains the variable definitions
  • StarterNotebook.ipynb - this is a starter notebook that will help you make your first submission on the leaderboard.