Multilingual Health Question Answering in Low-Resource African Languages Challenge 🩺

Zindi

Compete Jobs Learn Chat Leaderboard

More

For Business Partners Meet the team Press Case studies AI4EAC

Multilingual Health Question Answering in Low-Resource African Languages Challenge by ITU

$5 000 USD

Under code review

Skills you will learn

Large Language Models

NLP

1611 joined

574 active

Info Data Leaderboard

Start

Apr 30, 26

Close

Jun 21, 26

Reveal

Jun 21, 26

About

The training dataset contains maternal, sexual and reproductive health (MSRH) question-and-answer pairs across four African languages – Akan, Amharic, Luganda and Swahili and the English language – spanning nine language-country configurations. It comprises approximately 29,815 training records and 6,686 validation records. It is suitable for sequence-to-sequence tasks such as health question answering and text generation in low-resource African languages.

The test dataset follows the same structure. It consists of 2,618 records in total. Unlike the training data, this dataset contains only the input health questions. Participants must use their trained model to generate the corresponding answers, which will be evaluated against the reference answers.

There is no other extra information to add.

Files

Description

Files

This starter noteboook loads the data, trains a model and make a submission.

Claire Babirye presentation

Hash presentation by Dr Elizabeth Oseku

This file contains the question-and-answer pairs needed to train your model.

This file contains question-and-answer pairs for model validation.

This file contains the questions which you should be answered by your trained model.

This file shows the format and structure of the submission file.

Join the largest network for
data scientists and AI builders

About FAQs

Privacy Policy Terms of Use Rules

Status