💊 Hot Topic: Systematic incoherent labellin...

Makerere Passion Fruit Disease Detection Challenge

Helping Uganda

$1 000 USD

Completed (over 4 years ago)

Skills you will learn

Classification

Computer Vision

916 joined

171 active

Info Data Chat Leaderboard

Start

Aug 20, 21

Nov 21, 21

Reveal

Nov 21, 21

alenic

Systematic incoherent labelling on Train.csv

Data · 31 Aug 2021, 08:59 · 2

Hi, I would like to notify many incoherences on the dataset labeling, sometimes fruits are well visible and are not labeled, and sometimes yes, sometimes hided fruits are labeled and sometimes not, this labeling incoherence is a problem for the detection model...can we tag ourselves these missing labeling (only on the Train.csv)? These incoherences are also present in the test set?

Thanks

Discussion 2 answers

alenic

Thank you for the reply. In my opinion, handle a noisy dataset is not a problem (training set), but for the test set is a problem just to jusge which algorithm is the best, because to improve the score, you should "learn" how to model the labeling process if the distribution is the same, in other words you should learn the test bias, that for a production solution is not the best imo, but ok, thanks for the information :)

31 Aug 2021, 11:03

Upvotes 0

Prometheus

that's the whole challenge - otherwise wouldn't @amyflorida626 used their own baseline with a few tweaks?

replied to alenic5 Sep 2021, 17:24

Upvotes 0

Join the largest network for
data scientists and AI builders

About FAQs

Status