Primary competition visual

Your Voice, Your Device, Your Language Challenge

Helping Africa
1 000 CHF
Challenge completed ~1 month ago
Automatic Speech Recognition
Natural Language Processing
278 joined
73 active
Starti
Jul 22, 25
Closei
Sep 22, 25
Reveali
Sep 22, 25
User avatar
Koleshjr
Multimedia university of kenya
Zindi ML Live – Kiswahili ASR (Part 6): Data Prep for Wav2Vec
Platform · 12 Aug 2025, 20:08 · 0

Today’s focus was data preparation for Wav2Vec finetuning — making sure our audio and transcripts are clean, properly formatted, and ready for the model.

We went through:

  • Discussing “what’s next” for CTC finetuning — maybe training from scratch
  • Downloading datasets via Hugging Face
  • Preprocessing audio and transcripts
  • Structuring data for Wav2Vec (input_ids & labels)

Data prep is the foundation and today we poured a solid one.

Replay: https://youtu.be/CXYp2YoTMO4?si=xxJSNehXPy0uzqV8 Live Schedule: https://www.twitch.tv/koleshjr/schedule

📢 Subscribe: https://www.youtube.com/@koleshjr

Discussion 0 answers