🩺 Hot Topic: misalignment between the evalu...

Kenya Clinical Reasoning Challenge

Helping Kenya

$10 000 USD

Completed (~1 year ago)

Skills you will learn

Prediction

Natural Language Processing

SLM

1672 joined

439 active

Info Data Chat Leaderboard

Start

Apr 03, 25

Jun 29, 25

Reveal

Jun 30, 25

jmsmuigai

misalignment between the evaluation metric (ROUGE)

Data · 30 Jun 2025, 11:39 · 0

The primary flaw was misalignment between the evaluation metric (ROUGE) and the task objective (clinical reasoning). ROUGE rewarded superficial overlaps, not semantic or clinical depth, leading to model and strategy choices that undermined the challenge’s intended purpose.

Discussion 0 answers

Join the largest network for
data scientists and AI builders

About FAQs

Status