This is a private hackathon, open to Tanzanian citizens only. Please contact Zindi Ambassador Davis David for the secret code.
Swahili is spoken by 100-150 million people across East Africa. In Tanzania it is one of two national languages (the other is English) and it is the official language of instruction in all schools. News in Swahili is an important part of the media sphere in Tanzania.
News contributes to education, technology, and economic growth of a country, and news in local languages plays an important cultural role in many Africa countries. In the modern age, African languages in news and other spheres are at risk of being lost as English becomes the dominant language in online spaces.
The objective of this hackathon is to develop a multi-class classification model to classify news content according to their specific categories specified.The model can be used by Swahili online news platforms to automatically group news according to their categories and help readers find the specific news they want to read. In addition, the model will contribute to a body of work ensuring that Swahili is represented in apps and other online products in future.
This is a private hackathon. Please contact Zindi Ambassador for Davis David for the secret code.
Teams and collaboration
You may participate in this competition as an individual or in a team of up to four people. When creating a team, the team must have a total submission count less than or equal to the maximum allowable submissions as of the formation date. A team will be allowed the maximum number of submissions for the competition, minus the highest number of submissions among team members at team formation. Prizes are transferred only to the individual players or to the team leader.
Multiple accounts per user are not permitted, and neither is collaboration or membership across multiple teams. Individuals and their submissions originating from multiple accounts will be disqualified.
Code must not be shared privately outside of a team. Any code that is shared, must be made available to all competition participants through the platform. (i.e. on the discussion boards).
Datasets and packages
You may use only the datasets provided for this competition. Automated machine learning tools such as automl are not permitted.
You may use pretrained models as long as they are openly available to everyone.
You may work on any cloud platform such as Google Colab, AWS or similar, as long as 1) the data remains private and 2) doing so does not contravene Zindi’s rules of use.
You must notify Zindi immediately upon learning of any unauthorised transmission of or unauthorised access to the competition data, and work with Zindi to rectify any unauthorised transmission or access.
Your solution must not infringe the rights of any third party and you must be legally entitled to assign ownership of all rights of copyright in and to the winning solution code to Zindi.
Submissions and winning
You may make a maximum of 30 submissions per day. Your highest-scoring solution on the private leaderboard at the end of the competition will be the one by which you are judged.
Zindi maintains a public leaderboard and a private leaderboard for each competition. The Public Leaderboard includes approximately 50% of the test dataset. While the competition is open, the Public Leaderboard will rank the submitted solutions by the accuracy score they achieve. Upon close of the competition, the Private Leaderboard, which covers the other 50% of the test dataset, will be made public and will constitute the final ranking for the competition.
If your solution places 1st, 2nd, or 3rd on the final leaderboard, you will be required to submit your winning solution code to us to the host of the hackathon for verification. We will however encourage the winners to share their code on GitHub as a public good to the sector..
If two solutions earn identical scores on the leaderboard, the tiebreaker will be the date and time in which the submission was made (the earlier solution will win).
The winners will be paid via bank or mobile money transfer by responsible Zindi Ambassador.
You acknowledge and agree that Zindi may, without any obligation to do so, remove or disqualify an individual, team, or account if Zindi believes that such individual, team, or account is in violation of these rules. Entry into this competition constitutes your acceptance of these official competition rules.
Please refer to the FAQs and Terms of Use for additional rules that may apply to this competition. We reserve the right to update these rules at any time.
Reproducibility
Data standards:
Consequences of breaking any rules of the competition or submission guidelines:
Monitoring of submissions
The evaluation metric for this challenge is Log Loss.
The values can be between 0 and 1, inclusive.
Your submission file should look like:
test_id kitaifa michezo biashara kimataifa burudani SW1001 1 0 0 0 0 SW1005 0 0 1 0 0
To be eligible to win you must be Tanzanian.
1st Place: Tsh 100,000/=
2nd Place: Tsh 75,000/=
3rd Place: Tsh 50,000/=
Competition closes on 21 June 2020.
Final submissions must be received by 11:59 PM EAT.
We reserve the right to update the contest timeline if necessary.
Join the largest network for
data scientists and AI builders