The dataset provided by the Tunisian Ministry of Finance includes variables about tax analysis, taxpayer inspection, and VAT returns. The training dataset provided here is a subset of over 21,295 samples aggregated by year. You are provided with an anonymized dataset containing a large number of numeric variables. The "TARGET" column is the variable to predict. It equals the amount of the tax liability that the taxpayer has to adjust.
Files available for download
-
train.csv - this is the file you will use to train your model.
-
test.csv - this is the file you will use to test your model.
-
SampleSubmission.csv - is an example of what your submission file should look like. The order of the rows does not matter, but the names of the IDs must be correct. The column "target" is your prediction.
-
VariableDescription.csv - descriptions of the variables
-
Starter_notebook.ipynb - this is a starter notebook that will help you make your first submission and get on the leaderboard