iX Mobile Banking Prediction Challenge
Scholarship worth $3,495 USD
The prize is one scholarship to a 6-week data science remote program called ‘iX Remote’ from 6 July to 14 August 2021, valued at $3,495 USD.
242 data scientists enrolled, 103 on the leaderboard
Financial ServicesPredictionStructured
17 May—13 June
28 days
Should we use country_code and region as categorical features or numerical ones?
published 9 Jun 2021, 03:54

In reality, country_code and region are categorical features. But in our dataset, they are encoded as numerical ones. So, in which encoding format should we input these features for helping the model to better generalize?

They are categorical no matter the format they come in because each country has a unique number.

We all understand that they are categorical features. But passing them to the model as numerical features, the model will try to compute weight like someone belonging to a country with a higher country_code/region will have more/less chances to use mobile/internet banking. Anyway, thanks for sharing ...