Primary competition visual

EY Biodiversity Challenge

$3 500 USD
25 days left
Classification
Feature Engineering
Geospatial Data
Geospatial Analysis
989 joined
353 active
Starti
Mar 27, 26
Closei
May 24, 26
Reveali
May 24, 26
User avatar
Koleshjr
Multimedia university of kenya
⚠️ Clarification You Might Have Missed: Derived Spatial Features Are ALSO Banned
29 Apr 2026, 11:21 · 1

Just re-emphasizing an important clarification from the organizers that some people might have missed 👇

There seems to be some confusion around what counts as “using latitude/longitude” in this challenge.

The rule is not just about excluding the raw lat/lon columns — it also applies to any features derived from them.

👉 Not allowed:

  • Distance between points
  • Nearest neighbors / spatial proximity
  • Clustering based on coordinates
  • Grid cells / spatial bins
  • Any engineered feature that encodes geographic position

Even if you don’t explicitly include lat/lon in your model, these features still reintroduce spatial information, which goes against the intent of the competition.

👉 Allowed:

  • Using lat/lon strictly to extract external data like TerraClimate variables

👉 Not allowed:

  • Using lat/lon (directly or indirectly) as model features

A simple rule of thumb:

Using lat/lon to extract TerraClimate values = ✅ Allowed Using lat/lon to create model features (including distances) = ❌ Not allowed

The goal here is to ensure models rely on the provided environmental/climate data, not on geographic shortcuts.

Let’s keep things fair and aligned with the spirit of the competition 👍

Discussion that clarified this:

🌍 Challenge Chat: Action Required - Data Rules ... - 666 Views

Discussion 1 answer
User avatar
MICADEE
LAHASCOM

@Koleshjr Well said 👍. I support this in totality. In fact I needed to reimplement both my extraction and ML pipelines all over again when I saw the same things well pointed out by @meganomaly. Have sent both my old pipeline and newly updated pipeline to Zindi for proper review. So after uploading my new sub file from this new pipeline, my LB score went back to 0.9450 with CV score 0.9412 and another pipeline with LB score 0.9439 and CV score 0.9434 respectively. I hope and wish every participant will embrace this best standard practices and stick to the rules.

29 Apr 2026, 15:50
Upvotes 0