You are provided with video data from four cameras, with unique views of the four entrances and exits of the Norman Niles traffic roundabout. Each ~1 minute time segment is at the same period for all four cameras, and is labeled with a congestion classification [“free flowing”, “light delay”, “moderate delay”, “heavy delay”].
Your task is to develop models that extract and engineer features from the raw video data to predict the congestion level. These features should capture underlying traffic dynamics such as flow rate, number of vehicles, vehicle entry and exit timing, or other movement patterns that correlate with congestion. You are free to use any reproducible automated modeling or labeling techniques (excluding manual labelling). The test set contains only videos (no labels), and your model must infer congestion classes based solely on learned representations from the training phase.
You may augment or generate your own training data to increase the number of training samples. Ensure that your train creation process is reproducible and included in your submitted code.
Back propagation is NOT allowed during inference. Zindi is committed to providing solutions of value to the client. The implementation of this solution will be to ingest 15timestamps of video data and to predict the 18th to 23rd congestion_enter_rating and congestion_exit_rating.
This is the structure: Training data → Test input → 2-minute embargo (operational lag) → 5-minute test output.
Your solution should operate in real time, meaning:
Back-propagation within a training loop (i.e. updating model weights during normal training) is, of course, allowed — just keep the real-time deployment context in mind.
This means you should not use future data to predict the pass as this would not be possible in the real world.
Anecdotal evidence indicates that in Barbados, many drivers do not use turn signals when entering or exiting roundabouts — a key behavioral factor that may increase congestion. The Ministry hopes that insights from this challenge will help design targeted interventions to improve traffic flow across the island.
The top 20 solutions on the private leaderboard must provide documentation on the top factors contributing to congestion ratings. After the challenge closes, we will request a document from the top 20 with a table that includes feature name, feature contribution, and notes.
You are welcome to use any modeling and labelling techniques, except manual labelling, to label the data. Suggestions include flow rate, number of cars entering and exiting the roundabout, use of indicator, vehicle type, vehicle occupancy, or anything else you can identify from the video data.
For test periods, you are provided with 15 minutes input data, followed by a two-minute embargo. This data is embargoed because in practice, the video data will be processed during this time and won’t be available instantaneously. The prediction period is 5 minutes after the 2 minute embargo period.
Find the re-encoded video data in these Google drive folders:
📷
📷
📷
📷
📷The data is also available on Google Cloud:
brb-traffic contains the smaller re-encoded dataset: https://storage.googleapis.com/brb-traffic/
brb-traffic-full contains the full >500GB dataset: https://storage.googleapis.com/brb-traffic-full/
Join the largest network for
data scientists and AI builders