Primary competition visual

Nedbank Transaction Volume Forecasting Challenge

Helping South Africa
R250 000 ZAR
28 days left
Feature Engineering
Time-series
Forecast
111 joined
32 active
Starti
Apr 03, 26
Closei
May 03, 26
Reveali
May 06, 26
About

The dataset contains anonymised behavioural data for 11,944 Nedbank customers. It spans up to 34 months of transaction history (December 2012 through October 2015), monthly financial snapshots, and cleaned demographic profiles. Your task is to predict next_3m_txn_count - the total number of bank transactions each customer will make in a future three-month window (November 2015 through January 2016). All historical data is strictly before the prediction window. No target leakage is present in the feature tables.

Data Notes

The prediction window (November through January) spans the South African holidayseason - consider seasonality effects.

TransactionAmount is signed: negative values indicate debits, positive values indicate credits.

This is real-world banking data. As with any production dataset, you should expect missing values, inconsistencies, and fields that require careful inspection. Thoughtful data exploration and cleaning will be rewarded.

Files
Description
Files
Complete data dictionary listing every column across all files, with data types and descriptions.
8,360 customers with the target variable next_3m_txn_count. One row per customer.
3,584 customer IDs for which you must predict next_3m_txn_count. The target column is withheld.
Shows the required submission format. Contains all Test customer IDs with placeholder values.
Local scoring script. Run python evaluate.py my_submission.csv to check your RMSLE score before submitting.
A short overview to help you get started.
Introductory notebook demonstrating how to load the data, join tables, engineer basic features, and train a baseline model.
18 million transaction records covering December 2012 to October 2015. Each row is a single bank transaction with amount, type, balance, and debit/credit indicator. This is the primary feature source. Join to Train/Test on UniqueID.
372,000 monthly financial snapshot rows covering December 2013 to October 2015. Contains net interest income and revenue by product type (Transactional, Investments, Mortgages). 567 customers have no financials records - use left joins. Join on UniqueID.
One row per customer (11,944 rows - full coverage of Train and Test). Contains static demographic attributes: age, gender, income, occupation, industry, marital status, city, and banking relationship type.