🎙️ Data Talk: Error metric fixed and leaderb...

AI4D Yorùbá Machine Translation Challenge

Helping Nigeria

$2 000 USD

Completed (almost 5 years ago)

Skills you will learn

Machine Translation

684 joined

84 active

Info Data Chat Leaderboard

Start

Dec 04, 20

May 30, 21

Reveal

May 30, 21

Amy_Bray

Zindi

Error metric fixed and leaderboard rescored

Platform · 6 May 2021, 13:04 · 7

Dear competitors,

Thank you for your patience while we worked on the error metric. It proved harder than we initially thought due to the diacritics and different characters.

We have implemented the Rouge Score, reporting the F-measure. This error metric was implemented on 5 May 2021 and the leaderboard rescored.

The Recall-Oriented Understudy for Gisting Evaluation (ROUGE) scoring algorithm calculates the similarity between a candidate document and a collection of reference documents. Use the ROUGE score to evaluate the quality of document translation and summarization models [ref].

Once again, thank you for your patience and perseverance during this challenge.

Discussion 7 answers

AkashPB

Hi, Thanks for the update :)

But which ROUGE Score is used - ROUGE-L or ROUGE-1 ??

6 May 2021, 15:46 (edited 6 minutes later)

Upvotes 0

Amy_Bray

Zindi

Hi, it is ROUGE-N (N-gram) scoring (Rouge1), reporting the F-measure.

replied to AkashPB7 May 2021, 07:47

Upvotes 0

AkashPB

Ok got it !

replied to Amy_Bray7 May 2021, 08:03

Upvotes 0

serg132003

Is it possible to publish any starter code with this Rouge Score, used for evaluating and model training - for me as beginner is not clear how to use it?

7 May 2021, 10:54

Upvotes 0