Data Talk: 7th Place Solution w H2O LLM...

Microsoft Learn Location Mention Recognition Challenge

$5 000 USD

Challenge completed ~1 year ago

Skills you will learn

Natural Language Processing

Generative AI

1219 joined

365 active

Info Data Chat Leaderboard

Start

May 16, 24

Oct 13, 24

Reveal

Oct 13, 24

beluga

7th Place Solution w H2O LLM Studio

Platform · 11 Nov 2024, 12:34 · 2

Simple LLM fine-tuning solution with H2O LLM Studio

According to the latest rules only the provided train CSV was used for model training. The dataset had quite a few missing values with empty tweets and locations. All records with missing values were dropped. For train-validation split 80-20% random sample was used.

Initially I started with Small Language Models H2O Danube2-1.8B and H2O Danube3-4B. With H2O Danube3-4B it was possible to reach 0.132 on the Public Leaderboard.

Utilizing larger models (e.g. meta-llama/Meta-Llama-3-8B) resulted in slight improvements both on my validation set and the public leaderboard, achieving WER between 0.118 and 0.121.

You can find all the details at: https://github.com/gaborfodor/zindi-location-recognition

Discussion 2 answers

onyinye

World Quant University

Thank you!

11 Nov 2024, 17:53

Upvotes 0

Armand_PY_Kdp

Thank you

13 Nov 2024, 11:34

Upvotes 0

Join the largest network for
data scientists and AI builders

About FAQs

Status