Primary competition visual

Barbados Traffic Analysis Challenge

Helping Barbados
$11 000 USD
Completed (~1 month ago)
Computer Vision
Prediction
Object Tracking
Video Analytics
Deep Learning
1824 joined
222 active
Starti
Nov 07, 25
Closei
Jan 25, 26
Reveali
Jan 26, 26
User avatar
Amy_Bray
Zindi
đźš§ Challenge Update!
Platform · 10 Nov 2025, 15:07 · 7

We hope you’re managing to work smoothly with the video data. We’ve made a few important updates to help you get the most out of this challenge:

Prediction window updated

This challenge is tough — we’ve adjusted the prediction window to 5 minutes into the future. You will now have the following structure: Training data → Test input → 2-minute embargo (operational lag) → 5-minute test output.

Inference data clarified

The inference dataset is now explicitly labelled as test_input, so you know exactly what to use for inference.

Clarifying the “No Backpropagation” rule

As stated in the challenge design, backpropagation should not be used in training or inference in a way that uses future information to predict the past. Your solution should operate in real time, meaning:

  • Each minute must be predicted sequentially.
  • You cannot use data from minute N+1 to predict minute N.
  • You cannot use a training segment that follows a test period to inform or adjust your model during inference.

Backpropagation within a training loop (i.e. updating model weights during normal training) is, of course, allowed — just keep the real-time deployment context in mind.

This means you should not use future data to predict the pass as this would not be possible in the real world.

Data sharing and licensing update

Videos may now be uploaded to Kaggle, and the dataset license has been updated to CC BY 4.0.

These updates are designed to keep the competition fair, transparent, and fun for everyone.

If you’ve already started working, please re-download the updated files on Zindi (note the video data on the bucket has NOT changed), review the latest challenge description before resubmitting your solution.

Thank you for your patience and all the great feedback so far — and good luck!

Discussion 7 answers

I suspected this is what was meant by backpropagation being disallowed (this would be a data leakage issue). Thanks!

10 Nov 2025, 15:11
Upvotes 1
User avatar
Amy_Bray
Zindi

How are you doing with the error metric ;) It's an interesting one!

10 Nov 2025, 15:14
Upvotes 0

Wise choice for the competitions! I think in the real world ROI is the true metric so whatever works for the sponsors to maximize value from the solution!

User avatar
hark99
Self-employed

What is the name of the dataset at Kaggle

10 Nov 2025, 17:19
Upvotes 2

Training data → Test input → 2-minute embargo (operational lag) → 5-minute test output.

Does it mean that each video has a time limit of 2 minutes for reasoning?

11 Nov 2025, 03:10
Upvotes 0
User avatar
Amy_Bray
Zindi

Hi, you have training data to split and build as you see fit.

Then you have 15 segements of test input to a "forecasting" model to predict/forecast the 5 minute test output.

In a perfect, instanteous world we would have let you start forecasting/predicting from minute 16, but as always, there is inference time, video processing and storage delays that need to be taken into consideration. This is to say, that inference should take as fast as possible because 2-minutes operational lag is also for the video processing and storage access/saving delays.

So you are predicting/forecasting from minute 18-23. I hope this helps a bit.

User avatar
J0NNY

Hi Amy

What is the expectation in the real-time application of the model?

  • Will we have the congestion levels of the 15-minute segments and forecast the 18–23-minute segment? In this case, it means we can just create a forecasting model and expect the processing approaches by the client to give us the levels for the first 15 minutes. In this case, it means that it is possible to use the congestion levels given as a feature, which won't be the ideal case if the client won't be generating those levels.
  • Or do we need to use the video to generate the congestion levels of the first 15 minutes in our own model and again use the forecasting model to forecast the levels for minutes 18 to 23?