Swahili News Classification
Can you create a classification algorithm to identify Swahili news articles by category?
Prize
Knowledge
Time
Active
Participants
83 active · 500 enrolled
Helping
Tanzania
Classification
Media
About

The dataset describes 6439 rows of news from different sources in Tanzania.These news are in 5 different news categories from national news to entertainment news.

Your goal is to accurately classify each swahili news content into five specified categories below:

  • Kitaifa (National)
  • Kimataifa (International)
  • Biashara (Business)
  • Michezo (Sports)
  • Burudani (Entertainment)

How to use Colab on Zindi

How to mount a drive on Colab

Files
Description
Files
Full list of variables and their explanations.
Test resembles Train.csv but without the target-related columns. This is the dataset on which you will apply your model to.
Train contains the target. This is the dataset that you will use to train your model.
This shows the submission format for this competition, with the ‘ID’ column mirroring that of Test.csv and the ‘target’ column containing your predictions. The order of the rows does not matter, but the names of the ID must be correct.