Primary competition visual

African Credit Scoring Challenge

Helping Africa
$5 000 USD
Challenge completed 10 months ago
1889 joined
1022 active
Starti
Nov 29, 24
Closei
Jan 12, 25
Reveali
Jan 13, 25
About

You will work with loan and customer data from Kenya (train and test set) and Ghana (test set). This split emphasises the need for models that generalise well, i.e. perform well across different countries and financial contexts.

You will notice that a customer ID can appear in multiple rows of the dataset, and for a given customer, a loan ID can also appear with different lender IDs. For example, we can have the same customer ID on three rows with the same loan ID on each but a different lender ID. This observation means that the loan is funded by three separate lenders.

Additional data pulled from the Federal Reserve Economic Data (FRED) portal are provided to enrich model performance and offer insights into features that impact on credit scoring.

The economic indicators are:

  • FP.CPI.TOTL.ZG: Inflation, consumer prices (annual %)
  • PA.NUS.FCRF: Official exchange rate (LCU per US$, period average)
  • FR.INR.RINR: Real interest rate (%)
  • AG.LND.PRCP.MM: Average precipitation in depth (mm per year)
  • FR.INR.DPST: Deposit interest rate (%)
  • FP.INR.LEND: Lending interest rate (%)
  • FR.INR.LNDP: Interest rate spread (lending rate minus deposit rate, %)
  • EG.USE.COMM.FO.ZS: Fossil fuel energy consumption (% of total)
  • SL.UEM.TOTL.ZS: Unemployment rate

The data downloaded from FRED is saved in a separate CSV in the data section.

Files
Description
Files
This file is an example of what your submission file should look like.
This is the dataset on which you need to apply your trained model for prediction.
This is the dataset on which to train your machine learning model. You are welcome to enrich the dataset with economic data sourced from the Federal Reserve Bank on the countries of interest.
External data pulled from from the Federal Reserve Bank on the countries of interest.
A starter notebook showing how to load data and run a basic ML model.
A description of each variable contained in Train/Test.csv