You have been appointed as the Lead Data Analyst to build a predictive model to determine if a building will have an insurance claim during a certain period or not. You will have to predict the probability of having at least one claim over the insured period of the building.The model will be based on the building characteristics. The target variable, Claim, is a:
- 1 if the building has at least a claim over the insured period.
- 0 if the building doesn’t have a claim over the insured period.
The files for download are:
-
train_data.csv - is the dataset that you will use to train your model.
-
test_data.csv - is the dataset on which you will apply your model to.
-
sample_submission.csv - is an example of what your submission file should look like. The order of the rows does not matter, but the names of the "Customer Id" must be correct.
-
VariableDefinitions.csv - Definitions of variables in the train and test