Flight Delay Prediction Challenge
Predict airline delays for Tunisian aviation company, Tunisair
628 data scientists enrolled, 213 on the leaderboard
20 September 2019
Definition of variables
published 16 Nov 2019, 09:17

The definition of variables does not match the data in the train and test data. Variables like ATA(actual time of arrival), ATD(actual time of departure) etc do not appear in the train and test data

Yes, it was originally going to be a multi-class classification problem and they defined the variables in that way, but then they turned it into a regression problem in the last minute. So some variables that are defined here won't match the variables in the data. But for your mentionned variables, you can calculate that by adding the delay to the estimated time of arrival to get the actual time of arrival, and same for departure.

Thanks but which one is the delay?

should be a variable called 'TARGET', and the last one ,in order , in the train file.

Is the 'TARGET' in seconds or minutes?