Primary competition visual

Your Voice, Your Device, Your Language Challenge

Helping Africa
1 000 CHF
Challenge completed ~1 month ago
Automatic Speech Recognition
Natural Language Processing
278 joined
73 active
Starti
Jul 22, 25
Closei
Sep 22, 25
Reveali
Sep 22, 25
User avatar
Koleshjr
Multimedia university of kenya
Zindi ML Live: Kiswahili ASR – Exploring Speech-to-Text, Baselines & Model Errors (Part 1)
Platform · 30 Jul 2025, 20:44 · 0

In this stream, we kicked off a brand new challenge: building a Kiswahili speech-to-text (ASR) system for real-world, low-resource environments.

🧠 Here's what we did:

  • Broke down what ASR is for beginners
  • Explored the Zindi competition and the real-world impact of offline voice tech
  • Researched past ASR competitions and stalked a winning solution (spoiler: it was mine 😅)
  • Created a training-free baseline by hunting for pre-trained Swahili models on HuggingFace
  • Let AI write the code for us to simulate a beginner workflow
  • Faced a hilarious number of bugs (oops) but got it working in the end
  • Landed a Top 5 leaderboard position with zero training 🎉

FYI: The 3rd Placed sub is not shown on the stream but it is a hunted model as well(So keep hunting haha)

Tomorrow we explore: conformer-ctc-asr baselines

📺 Watch the full replay here: https://youtu.be/rZuEj6JBZOY?si=QDg2S_5qzqD714cX

📌 Subscribe to catch future ML streams: https://www.youtube.com/@koleshjr

📆 Join live sessions (Tues/Wed/Thurs): https://www.twitch.tv/koleshjr/schedule

Discussion 0 answers