Primary competition visual

Your Voice, Your Device, Your Language Challenge

Helping Africa
1 000 CHF
Challenge completed ~1 month ago
Automatic Speech Recognition
Natural Language Processing
278 joined
73 active
Starti
Jul 22, 25
Closei
Sep 22, 25
Reveali
Sep 22, 25
User avatar
Koleshjr
Multimedia university of kenya
Zindi ML Live – Kiswahili ASR (Part 3): Data Prep for CTC Finetuning
Platform · 5 Aug 2025, 20:01 · 1

Today’s stream was shorter than usual but no less important. We spent the session working through data preparation for CTC Finetuning, which ended up being more involved than expected. 😅

Here’s what went down:

  • Converted the dataset into NeMo's required format
  • Fixed transcript and audio alignment issues
  • Set up everything needed to start fine-tuning ASR models next stream

Even though there wasn’t much modeling today, this was a big foundational step and now we’re set for real training next session!

📺 Watch the replay: https://youtu.be/IpqdBBF-kFU

📆 Next Live Stream: https://www.twitch.tv/koleshjr/schedule

📹 All Replays: https://www.youtube.com/@koleshjr

Shoutout to everyone who still showed up, the grind never stops! 💯

Discussion 1 answer
User avatar
yehoshua

A little busy but will see it tomorrow. May be on weekend. I will update you!,😉

5 Aug 2025, 21:34
Upvotes 0