Primary competition visual

AI4EAC Finance Practice Challenge

Helping Burundi, Democratic Republic of Congo
and 6 other countries
  • Burundi
  • Democratic Republic of Congo
  • Kenya
  • Rwanda
  • Somalia
  • Uganda
  • Tanzania, United Republic of
  • South Sudan
  • Scroll to see more
1500 Zindi points
Under code review
Prediction
Classification
Feature Engineering
104 joined
44 active
Starti
Mar 06, 26
Closei
Mar 27, 26
Reveali
Mar 28, 26
About

You will work with loan and customer data from Kenya (train and test set) and Ghana (test set). This split emphasises the need for models that generalise well, i.e. perform well across different countries and financial contexts.

You will notice that a customer ID can appear in multiple rows of the dataset, and for a given customer, a loan ID can also appear with different lender IDs. For example, we can have the same customer ID on three rows with the same loan ID on each but a different lender ID. This observation means that the loan is funded by three separate lenders.

Additional data pulled from the Federal Reserve Economic Data (FRED) portal are provided to enrich model performance and offer insights into features that impact on credit scoring.

The economic indicators are:

  • FP.CPI.TOTL.ZG: Inflation, consumer prices (annual %)
  • PA.NUS.FCRF: Official exchange rate (LCU per US$, period average)
  • FR.INR.RINR: Real interest rate (%)
  • AG.LND.PRCP.MM: Average precipitation in depth (mm per year)
  • FR.INR.DPST: Deposit interest rate (%)
  • FP.INR.LEND: Lending interest rate (%)
  • FR.INR.LNDP: Interest rate spread (lending rate minus deposit rate, %)
  • EG.USE.COMM.FO.ZS: Fossil fuel energy consumption (% of total)
  • SL.UEM.TOTL.ZS: Unemployment rate

The data downloaded from FRED is saved in a separate CSV in the data section.

Files
Description
Files
External data pulled from from the Federal Reserve Bank on the countries of interest.
This is the dataset on which to train your machine learning model. You are welcome to enrich the dataset with economic data sourced from the Federal Reserve Bank on the countries of interest.
This file is an example of what your submission file should look like.
This is the dataset on which you need to apply your trained model for prediction.
A starter notebook showing how to load data and run a basic ML model.
A description of each variable contained in Train/Test.csv