Target Value

Traffic Jam: Predicting People's Movement into Nairobi

Helping Kenya

$12 000 USD

Challenge completed almost 7 years ago

Skills you will learn

Prediction

1469 joined

203 active

Info Data Chat Leaderboard

Start

Sep 06, 18

Jan 13, 19

Reveal

Jan 14, 19

Maduflavins

Target Value

Data · 17 Nov 2018, 14:58 · 6

But seriously guys I have been looking at the dataset for a while now still very confused. where is the target value(num_of tickets). or are we supposed to generate that? and then use it for training?

Discussion 6 answers

dkaila

Hi. Number of tickets is determined by the number of people in a specific bus, at a specific time. Which you can infer from the given data. If you cant make that bit up........ 🤷🏿‍♂️

17 Nov 2018, 18:46

Upvotes 0

stefan

I think your question might deserve more credit than the response by @dkaila. If everyone "makes that bit up" - trivial or otherwise, how are they going to declare a winner? I'm assuming they will in some way validate the results on the test set?

Different people are bound to make different adjustments to the data in order to purify it. Without a target your whole data to information pipeline is backward should you validate the answer assuming a target (because your target is in this case a function of the data?).

Now I haven't attempted this challenge myself, but it seems to me if 5 people calculated the response differently they could all backpropogate high accuracies but recieve ambigious scores on your test results - all due to the response itself being "trivial"?

Now I understand this should be trivial. Just remarking on the fact that the answer to this question may not be so simple as to provoke a retort -> ` If you cant make that bit up........ 🤷🏿‍♂️ `.

19 Nov 2018, 12:01

Upvotes 0

r256p

Hello Stefan, you wont have different target since we are using count of ids

replied to stefan5 Dec 2018, 08:06

Upvotes 0

Sophicist

The Data doesnt explicitly have a target value,you create it. Use the groupby function of the ride_id column and aggegate by count,you will get the number of tickets that were sold per id. or just follow the notebook that was shared by one of us https://github.com/pawelmorawiecki/traffic_jam_Nairobi/blob/master/RandomForest.ipynb

1 Dec 2018, 08:17 (edited 1 minute later)

Upvotes 0