Melio MLOps Machine Translation Challenge 🇫🇷

Melio MLOps Machine Translation Challenge

$6 500 USD

Completed (almost 2 years ago)

Skills you will learn

MLOps

Natural Language Processing

Machine Translation

448 joined

42 active

Info Data Leaderboard

Start

Mar 03, 24

Enrolments close

Aug 19, 24

Sep 01, 24

Reveal

Sep 17, 24

About

This is a parallel corpus dataset for machine translation from French to Dyula. It contains roughly 10929 French to Dyula parallel sentences. The corpus is an extension of the Comonvoice version 4 French corpus (https://arxiv.org/abs/1912.06670) to the Dyula language. Sentences collected in French were pre-processed and then manually translated into Dyula by a linguist. The collected sentences were preprocessed and aligned manually.

This dataset collection efforts have been supported by International Development Research Centre (IDRC) and Swedish International Development Cooperation Agency (SIDA), managed by African Center for Technology Studies (ACTS) in collaboration with the Université Virtuelle de Côte d'Ivoire (UVCI) and Data354 through the programme Artificial Intelligence for Development (AI4D) Africa.

You can access the dataset here: https://huggingface.co/datasets/uvci/Koumankan_mt_dyu_fr

The dataset is available on the Highwind platform. Please see this documentation: https://docs.highwind.ai/zindi/experiment/

Files

Description

Files

Here are answers to FAQs.

Slides from the webinar

Here is additional information on how the evaluation is calculated.

Join the largest network for
data scientists and AI builders

About FAQs

Status