Primary competition visual

Barbados Traffic Analysis Challenge

Helping Barbados
$11 000 USD
~2 months left
Computer Vision
Prediction
Object Tracking
Video Analytics
Deep Learning
1478 joined
110 active
Starti
Nov 07, 25
Closei
Jan 26, 26
Reveali
Jan 26, 26
About

You are provided with video data from four cameras, with unique views of the four entrances and exits of the Norman Niles traffic roundabout. Each ~1 minute time segment is at the same period for all four cameras, and is labeled with a congestion classification [“free flowing”, “light delay”, “moderate delay”, “heavy delay”].

Your task is to develop models that extract and engineer features from the raw video data to predict the congestion level. These features should capture underlying traffic dynamics such as flow rate, number of vehicles, vehicle entry and exit timing, or other movement patterns that correlate with congestion. You are free to use any reproducible automated modeling or labeling techniques (excluding manual labelling). The test set contains only videos (no labels), and your model must infer congestion classes based solely on learned representations from the training phase.

You may augment or generate your own training data to increase the number of training samples. Ensure that your train creation process is reproducible and included in your submitted code.

Back propagation is NOT allowed during inference. Zindi is committed to providing solutions of value to the client. The implementation of this solution will be to ingest 15timestamps of video data and to predict the 18th to 23rd congestion_enter_rating and congestion_exit_rating.

This is the structure: Training data → Test input → 2-minute embargo (operational lag) → 5-minute test output.

Your solution should operate in real time, meaning:

  • Each minute must be predicted sequentially.
  • You cannot use data from minute N+1 to predict minute N.
  • You cannot use a training segment that follows a test period to inform or adjust your model during inference.

Back-propagation within a training loop (i.e. updating model weights during normal training) is, of course, allowed — just keep the real-time deployment context in mind.

This means you should not use future data to predict the pass as this would not be possible in the real world.

Anecdotal evidence indicates that in Barbados, many drivers do not use turn signals when entering or exiting roundabouts — a key behavioral factor that may increase congestion. The Ministry hopes that insights from this challenge will help design targeted interventions to improve traffic flow across the island.

The top 20 solutions on the private leaderboard must provide documentation on the top factors contributing to congestion ratings. After the challenge closes, we will request a document from the top 20 with a table that includes feature name, feature contribution, and notes.

You are welcome to use any modeling and labelling techniques, except manual labelling, to label the data. Suggestions include flow rate, number of cars entering and exiting the roundabout, use of indicator, vehicle type, vehicle occupancy, or anything else you can identify from the video data.

For test periods, you are provided with 15 minutes input data, followed by a two-minute embargo. This data is embargoed because in practice, the video data will be processed during this time and won’t be available instantaneously. The prediction period is 5 minutes after the 2 minute embargo period.

Find the re-encoded video data in these Google drive folders:

The data is also available on Google Cloud:

brb-traffic contains the smaller re-encoded dataset: https://storage.googleapis.com/brb-traffic/

brb-traffic-full contains the full >500GB dataset: https://storage.googleapis.com/brb-traffic-full/

Files
Description
Files
Train contains the target. This is the dataset that you will use to train your model.
Test resembles Train.csv but without the target-related columns. This is the dataset on which you will apply your model to.
Notebook to help you view the videos from the Google Cloud Bucket.
Is an example of what your submission file should look like. The order of the rows does not matter, but the names of the "ID" must be correct.
Download the dataset using a torrent client