‘AIntuition’: Retrieval Augmented Generation (RAG) for Public Services and Administration Tasks

‘AIntuition’: Retrieval Augmented Generation (RAG) for Public Services and Administration Tasks by ITU

$1 500 USD

Completed (~2 years ago)

Skills you will learn

Generative AI

212 joined

21 active

Info Data Chat Leaderboard

Start

May 16, 24

May 17, 24

Reveal

May 17, 24

About

Participants are free to use any appropriate open-source dataset or to curate their own as long as this does not violate any data privacy regulations that apply.

Below are some databases and libraries that could be used to assemble custom datasets for fine tuning text embedding models and for running preparatory validation tests:

United Nations Digital Library: Documents and Publications https://digitallibrary.un.org/collection/Documents%20and%20Publications?ln=en
African Development Bank Documents: https://www.afdb.org/en/all-documents
BRICS Legal and Policy Documents: https://infobrics.org/documents/
EU Legal Documents: EUR-Lex: https://eur-lex.europa.eu/homepage.html

Any dataset used for fine-tuning embedding models or LLMs needs to be submitted along with the proposed solution and made open-source (for the reasons of transparency).

The test set will be made available on 16 May 2024 at 23:59 PM GMT. You will have 24 hours to do your inference and submit your submission. Note, only your most recent submission will be considered for evaluation.

You need to submit ONE .ZIP file that contains the following:

Solution that creates your solution
Documentation
Submission CSV, use the SampleSubmission provided.

Please ensure your .ZIP file is less than 30mb.

Files

Description

Files

Reference file for you to compare your submissions to.

Here are the slides from the webinar.

These are the files you will use during the testing phase.

Here are the questions. Fill out this dataframe and submit it to Zindi by 17 May 23:59 PM GMT.

Join the largest network for
data scientists and AI builders

About FAQs

Status