Primary competition visual

Mobile Money and Financial Inclusion in Tanzania Challenge

Helping Tanzania, United Republic of
$2 250 USD
Challenge completed over 6 years ago
Prediction
703 joined
162 active
Starti
Mar 26, 19
Closei
Jun 30, 19
Reveali
Jul 01, 19
Merging Map data
Data · 14 May 2019, 02:14 · 4

Having trouble merging the 9 mapping files so i can add features to the train dataset. Tried using os module in python but running into encoding errors. Anyone with a a work around?

Discussion 4 answers

I'm facing a similar problem

14 May 2019, 05:58
Upvotes 0

use pandas and "ISO-8859-1" encoding.

like this:

pd.read_csv('FSDT_FinAccessMapping/3rd_ppp_for_upload_win.csv', encoding="ISO-8859-1")

15 May 2019, 08:45
Upvotes 0

In general when I run into encoding errors, it's usually because of some strange characters in the dataset often found in people's names etc.

A quick way to get around this is to use:

pd.read_csv(fpath, encoding='latin')

I like to use latin because I can't always remember all the ISO codes and then would have to google around a bit, while latin usually just works ;)

I hope that you was able to read the files, there are 3 kind of features that will help you a lot to boost your score , you can try to use knn with coordinates as input to predict the district and region from external data

18 Jun 2019, 14:56
Upvotes 0