Primary competition visual

AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF

Helping Senegal
$2 000 USD
Challenge completed over 4 years ago
Classification
Automatic Speech Recognition
Natural Language Processing
364 joined
47 active
Starti
Feb 12, 21
Closei
May 23, 21
Reveali
May 23, 21
12th place solution
Notebooks · 27 May 2021, 17:37 · 1

Hello everyone, this is the summary of the 12th solution:

Fine-tune facebook wav2vec2 without any data augmentation or parameter tuning, got 12% WER.

Using two effects (reduce speed + reverberation) to transform data with p=0.5, reduce both the attention_dropout and the hidden_dropout to 0.05, got 0.099 WER.

Other effects such as adding noise, gain, and pitch shift did not improve the results.

Code: https://github.com/anashas/Automatic-Speech-Recognition-in-WOLOF

Discussion 1 answer
User avatar
Lone_Wolf
University of ghana

Great work Sir

27 May 2021, 21:59
Upvotes 0