🩺 Challenge Chat: Suggestion: Reopen and Extend ...

Kenya Clinical Reasoning Challenge

Helping Kenya

$10 000 USD

Completed (~1 year ago)

Skills you will learn

Prediction

Natural Language Processing

SLM

1672 joined

439 active

Info Data Chat Leaderboard

Start

Apr 03, 25

Jun 29, 25

Reveal

Jun 30, 25

khushimalik19

Suggestion: Reopen and Extend the Competition with Fair Evaluation

30 Jun 2025, 12:59 · 0

The current reliance on the ROUGE score is fundamentally misaligned with the competition’s goals. ROUGE rewards surface-level word overlap, not clinical correctness, reasoning quality, or safety ,all of which are vital in healthcare applications. This misguides participants to optimize for phrasing tricks rather than real, explainable medical logic.

As a participant, I experienced this first-hand: a stronger submission was mistakenly not uploaded in time due to a file mix-up. Since the leaderboard is driven by a flawed metric, this error now unfairly penalizes efforts that genuinely focused on safe, structured clinical reasoning.

I urge the organizers to consider reopening and extending the competition, and to adopt a more appropriate evaluation method, one that aligns with clinical standards and captures real-world utility. This would not only ensure fair judgment for all contributors but also serve the end-users with more trustworthy, high-impact solutions.

Warm regards, Khushi Malik

Discussion 0 answers

Join the largest network for
data scientists and AI builders

About FAQs

Status