Primary competition visual

SUA Outsmarting Outbreaks Challenge

Helping Tanzania, United Republic of
$12 500 USD + AWS credits
Completed (~1 year ago)
Prediction
815 joined
395 active
Starti
Dec 06, 24
Closei
Jan 31, 25
Reveali
Feb 01, 25
User avatar
Abdallah_Abra
Dataset Hints at Extra Columns—Are We Missing Data?
Data · 14 Jan 2025, 16:52 · 6

Hi everyone,

I came across something puzzling while going through the AWS recourses GitHub repo for this challenge. There seem to be references to additional data points that aren't included in the provided dataset. For instance, I've noticed mentions of age group (three times), as well as indicators like toilet quality assessments and infrastructure quality metrics.

Am I missing something here? Do we indeed have access to the full dataset, or are these additional indicators omitted? Could @Zindi kindly confirm whether we have all the columns intended for this challenge?

Also, I’m curious to know if other participants have noticed this as well.

Thanks in advance for clarifying!

Discussion 6 answers
User avatar
Abdallah_Abra

For additional context here's the github repo I'm referreing to.

15 Jan 2025, 13:34
Upvotes 1
User avatar
CodeJoe

Have you tried using it in anyways?

User avatar
Abdallah_Abra

I'm not sure I understand what you mean by "it". Can you elaborate?

User avatar
CodeJoe

The dataset from AWS. Not the one on the competition.

User avatar
Freelance

I guess this competition focuses only on disease type.

16 Jan 2025, 07:35
Upvotes 0

As i checked there is no difference in data

19 Jan 2025, 13:37
Upvotes 1