Primary competition visual

Lacuna Masakhane Parts of Speech Classification Challenge

Helping Africa
$7 000 USD
Completed (over 2 years ago)
Classification
Natural Language Processing
472 joined
101 active
Starti
Jun 08, 23
Closei
Sep 17, 23
Reveali
Sep 17, 23
User avatar
Muhamed_Tuo
Inveniam
8th Place Solution
Notebooks · 20 Sep 2023, 22:52 · 5

I'd like to start by thanking @JEANMPIA on his contributions in the discussions. I can't recall how many times I said I was done with this contest :) I started working on it, 2 weeks before the end, and got stuck at 0.45/0.47 for the most part of it. Reading the discussions kept me going and ultimately led to this result.

Overview

My solution is an Afro-xlmr-large model finetuned on the provided training data, some additionnal data ( afrikaans, arabic, french and english ) and the 2nd rounds of pseudo labels done on Luo & Setswana monolingual data.

Pipeline

You can find the code here on github.

Discussion 5 answers

Great solution, It really seems like refined pseudos were key in this comp.

thanks for sharing

20 Sep 2023, 23:27
Upvotes 0
User avatar
Muhamed_Tuo
Inveniam

Yeah, seeing the gain, I wish I could have done a few more rounds :)

After the 4rth iteration, the boost is very small, but yeah maybe a few more.. :)

User avatar
Busang
Freelance

Congratulations and thank you for sharing

21 Sep 2023, 00:36
Upvotes 2
User avatar
Juliuss
Freelance

Good job Tuo 👏🏿 and thank you for sharing your solution 👍

21 Sep 2023, 03:05
Upvotes 1