What is the relationship between uber data and mobi ticket data?. I cant seem to find a merge key between the two data set. Is it just me.
There are problems because 'date' is not unique in the mobi data ie there is more than one journey on each day and not a journey on every day. Uber data is consecutive if you search on the full range of dates. I set it up on a loop so that for each row in my mobi df it looked at the date and then entered a time value from the corresponding date on my uber df. It iterated through quite slowly but did the job. How much extra accuracy it adds remains to be seen and the gaps in some of the uber data mean quite a bit of cleaning. I am sure there are better ways but that was my approach.