🐼 AI in Focus: Yet another proof of leaderboa...

Turtle Recall: Conservation Challenge

Helping Kenya

$10 000 USD

Completed (~4 years ago)

Skills you will learn

Classification

Computer Vision

756 joined

246 active

Info Data Chat Leaderboard

Start

Nov 19, 21

Apr 21, 22

Reveal

Apr 21, 22

kostyayatsok

Yet another proof of leaderboard malfunction

Data · 7 Mar 2022, 20:12 · 1

Hi all,

There were a lot of disscusions about low public scores despite high validation, and there's my two cents about this phenomenon.

I assume that some mess happens with image_id column in test.csv. The point is that in test data there is no any correlation between provided image_location and real one. However, train data is labeled almost perfectly. You can see some examples here: https://ibb.co/album/QjWZvV. Also my code for generation: https://colab.research.google.com/drive/1T4jHIcHNvIZgDa--FpM9i0SqvaWDkrR6?usp=sharing.

Such a significant difference between quontity of wrong location labels in test and train defenetly not normal and should be fixed somehow.

Wrong match between "image_id" and labels can explain both location issue and low public scores.

Hopefully the issue will be found and resolved and we will have a great contest!

Discussion 1 answer

Fnoa

Good point!

Don't have much hope that they will solve it. They don't even respond.

8 Mar 2022, 11:16

Upvotes 0

Join the largest network for
data scientists and AI builders

About FAQs

Status