Intron AfriSpeech-200 Automatic Speech Recognition Challenge
Can you create an automatic speech recognition (ASR) model for African accents, for use by doctors?
Prize
$5 000 USD
Time
2 months to go
Participants
14 active · 202 enrolled
Advanced
Automatic Speech Recognition
Health
Media
About

The data is from two domains, healthcare (~60%) and general (~40%); general domain includes news, sports, entertainment, politics, and Wikipedia.

There are 196 hours of accented English recordings; audio clips are ~ 11 seconds on average and are from 13 different countries covering 120 accents from West, South, and East Africa.

There are 57 819 recordings in train, 3 227 in dev and 5 070 test.

Winning models should be submitted in original (pytorch, tensorflow, etc) and ONNX format for portability and to ease testing.g and bulky models

NOTE: The test audio files will only be uploaded on 19 May 2023, one week before the close of the challenge. The test files will be the private leaderboard and will constitute the final leaderboard for this challenge.

NOTE: The SampleSubmission.csv contains "audio_ids" for both dev and test, even though test audio files are not available until 19 May 2023. From the launch of the challenge to 19 May 2023 you need to submit your predictions for the dev audio_ids and submit "" for the test audio_ids, AFTER 19 May 2023 you need to submit your predictions for the dev audio_ids and submit your actual predictions for the test audio_ids that will be made available on 19 May 2023.

How to use Colab on Zindi

How to mount a drive on Colab

Files
Description
Files
These are the audio files you will use to test your model on.
These are the audio files you will use to train your model on.
This file contains demographic information and the transcription for the audio files in afrispeech-dev.zip
Is an example of what your submission file should look like. The order of the rows does not matter, but the names of the ID must be correct.
This file contains demographic information about the audio files in afrispeech-dev.zip
This file contains demographic information about the audio files in afrispeech-test.zip. NOTE: The africspeech-test.zip will be uploaded on 19 May 2023.