🌱 Join the Buzz: Updated reference file

ICLR Workshop Challenge #1: CGIAR Computer Vision for Crop Disease

Helping East Africa

$5 000 USD

Completed (over 6 years ago)

Skills you will learn

Classification

Computer Vision

1058 joined

338 active

Info Data Chat Leaderboard

Start

Jan 29, 20

Mar 28, 20

Reveal

Mar 29, 20

ZINDI

Updated reference file

Platform · 17 Feb 2020, 14:29 · edited 31 minutes later · 9

Dear Zindians,

As some of you have noted, there are duplicate images that appear in the train and test set. We acknowledge that this is not an ideal situation, so please accept our apologies to the whole Zindi community. Zindi is still a young platform, and we are learning and trying to be better with every challenge we release.

We have updated the reference file to exclude the "data leak" images. Over the next 48 hours your scores might change as we rescore your submissions based on the new reference files.

You can still use the Sample Submission as a template to submit as the scoring system will only score the IDs in the updated reference file.

Please note that you cannot use file metadata such as EXIF data. This is a computer vision, machine learning challenge. If you are in the top 3 of this challenge you will be required to submit your code for validation.

Thank you for keeping us accountable, and for helping to make Zindi better. We couldn’t do it without you.

The Zindi team

Discussion 9 answers

Val_An

Hi,

could you please inform us after rescoring is done?

Are new submissions evaluated without the "leak"?

Thx

17 Feb 2020, 15:41

Upvotes 0

DatNT

can you explain the "submit code for validation"? what code will the winner suppose to upload? inference code and model checkpoints?

17 Feb 2020, 16:02

Upvotes 0

Val_An

I hope they will ask for a whole script that can reproduce the training fully. Otherwise someone can just handlabel test set and include it into training. Not?

replied to DatNT17 Feb 2020, 16:19

Upvotes 0

DatNT

yeah. it's really hard to say in competition with small test set like this. someone with expertise can handlabel the data then submit 0.000000 loss csv. but if full reproducible is required, i might have to do everything over again since i can't remember all the experiments and even if i can, randomness will beat the crap out of me :D

replied to Val_An17 Feb 2020, 16:34

Upvotes 0

thelastpolaris

Given the quality of labels, I doubt that somebody can hand label it and get 0 loss :D

17 Feb 2020, 16:44 (edited ~3 hours later)

Upvotes 0

thelastpolaris

Will Zindi update the leaderboard manually or we should just wait?

17 Feb 2020, 20:04

Upvotes 0

robga

I agree with Val_An. Fully reproducible training and inference code should be provided by top teams. This is a standard requirement in challenges to provide trust so that people will even enter the competition. Moderators understand that there is an element of randomness to runs when performing final evaluation.

17 Feb 2020, 20:10

Upvotes 0

chrisholder

In my code I currently check each image's EXIF data for an orientation tag to ensure they are input the right way up, is this now not allowed?

19 Feb 2020, 04:50

Upvotes 0

ZINDI

You may use EXIF data for orientation and regularization. However, you cannot use it in any modelling or for prediction.

replied to chrisholder19 Feb 2020, 11:13

Upvotes 0

Join the largest network for
data scientists and AI builders

About FAQs

Status