We’ve split up the DRC into ~3800 equal areas, centered around the locations provided in the lat/lon columns. The target variable, burn_area, is the percentage of the area that has been burned in a given month. Due to the way it’s measured, there may be some overlap of burned areas for two successive months, and so total burned area over a time period isn’t necessarily equal to the sum of the ‘burn_area’ figures for all months. You are not permitted to use external data in this competition.
You do NOT need to use GIS data to solve this challenge.
In order to make access to the data easier for all participants, we have provided download links. We recommend you download the data before the challenge. The data is password protected, and we will share the password to all universities as well as on the livestream when the competition opens.
Folder codes will be shared on the day at 09:00 GMT on the University rep WhatsApp groups.
The additional data included in the test and train files is as follows: