Primary competition visual

Your Voice, Your Device, Your Language Challenge

Helping Africa
1 000 CHF
Challenge completed ~1 month ago
Automatic Speech Recognition
Natural Language Processing
278 joined
73 active
Starti
Jul 22, 25
Closei
Sep 22, 25
Reveali
Sep 22, 25
User avatar
Koleshjr
Multimedia university of kenya
Zindi ML Live: Kiswahili ASR - From CTC Theory to Real Improvements!
Platform · 31 Jul 2025, 19:41 · 4

Day 2 of our Kiswahili ASR challenge stream focused on Conformer + CTC-based models and wow, the progress we made was huge!

✅ What we did:

  • Explained CTC models and how they work in the context of speech recognition
  • Shared easy-to-understand resources for learning about CTC and Conformer architectures
  • Wrote custom inference code to run models fine-tuned on Swahili
  • Improved our score from 0.55 to 0.44 on the leaderboard 🔥

💡 But here’s the best part one of the stream viewers, @Joseph_gitau, coded live along with the stream and even beat my score! That moment really highlighted the value of urgency and real-time collaboration.

If you're thinking of getting involved, join the streams live! The learning, engagement, and results hit differently when you're there in real time. That said, the YouTube replays are still there for anyone catching up:

📹 Replay: Zindi ML Live: Kiswahili ASR - From CTC Theory to Real Improvements (Part 2)

📺 Channel: https://www.youtube.com/@koleshjr

📆 Next Streams: https://www.twitch.tv/koleshjr/schedule

Let’s keep pushing forward, see you in the next one!

Discussion 4 answers
User avatar
msamwelmollel
University of Glasgow

Nice tutorial! From the company repo, they present the baseline values.

https://github.com/Sartify/Swahili-Challenge-Competition---Pan-African-Wide-Alignment-PAWA-ASR

1 Aug 2025, 14:17
Upvotes 0
User avatar
Koleshjr
Multimedia university of kenya

Wow 0.13? That is very impressive and we are currently very far from that baseline but we will try our best to beat it! But I assume that is after you used the pawa llm for corrections? what was the low WER from the STT itself?

User avatar
msamwelmollel
University of Glasgow

This is 0.13 without Pawa.

User avatar
Koleshjr
Multimedia university of kenya

greatttt