Primary competition visual

UmojaHack South Africa: Yassir ETA Prediction Challenge by UmojaHack Africa

Helping South Africa
R15 000 ZAR
Challenge completed over 5 years ago
Prediction
161 joined
119 active
Starti
Jul 25, 20
Closei
Jul 25, 20
Reveali
Jul 25, 20
About

The data contains details for 119,549 trips (train and test are split by date). Each row contains a start location and end location (reported as latitude and longitude to within approximately 100m) and the travel distance along the fastest route. Each trip also has a timestamp, which can be used to pull the weather for that day from Weather.csv file. The weather data includes temperature, rainfall and wind speed for the time period during which the trip data was collected.

A starter notebook will be provided. This notebook will help you make your first submission onto the leaderboard. Download the data and run the notebook on your local machine and you will be able to make a submission on the leaderboard.

Files available for download:

  • Train.csv - contains the input variables and the target, ‘ETA’
  • Test.csv - matches Train.csv but without the ETA
  • SampleSubmission.csv - is an example of what your submission file should look like. The order of the rows does not matter, but the names of the IDs must be correct.
  • Weather.csv - contains daily weather summaries, based on data from the ERA5 dataset.
  • Sample_workbook.ipynb - this Python starter notebook will help you make your first submission onto the leaderboard. Download the data and run this notebook on your local machine and you will be able to make a submission on the leaderboard.
  • StarterNotebook.ipynb - this Python starter notebook will help you improve your model. Download the data and run this notebook on your local machine and you will be able to make a submission on the leaderboard.
  • StarterNotebook.R - this R starter notebook will help you make your first submission onto the leaderboard. Download the data and run this notebook on your local machine and you will be able to make a submission on the leaderboard.

Variable Descriptions:

  • 'ID' - A unique ID
  • 'Timestamp' - Time that the trip was started
  • 'Origin_lat', 'Origin_lon' - Origin (in degrees latitude and longitude)
  • 'Destination_lat', 'Destination_lon' - Destination
  • 'Trip_distance' - Distance in meters on a driving route
  • 'ETA' - Estimated trip time in seconds
Files
Description
Files