Tunisian Fraud Detection Challenge
Detect tax fraud using the Ministry of Finance of Tunisia's data
Prize
Knowledge
Time
Active
Participants
207 active · 1124 enrolled
Helping
Tunisia
Good for beginners
Classification
Government
About

The dataset provided by the Tunisian Ministry of Finance includes variables about tax analysis, taxpayer inspection, and VAT returns. The training dataset provided here is a subset of over 25,000 samples aggregated by year. You are provided with an anonymized dataset containing a large number of numeric variables. The "TARGET" column is the variable to predict. It equals the amount of the tax liability that the taxpayer has to adjust.

How to use Colab on Zindi

How to mount a drive on Colab

Files
Description
Files
Train contains the target. This is the dataset that you will use to train your model.
This shows the submission format for this competition, with the ‘ID’ column mirroring that of Test.csv and the ‘target’ column containing your predictions. The order of the rows does not matter, but the names of the ID must be correct.
Test resembles Train.csv but without the target-related columns. This is the dataset on which you will apply your model to.
Full list of variables and their explanations.