Primary competition visual

International Women's Day Challenge

Helping the World
3 000 Points
Challenge completed 8 months ago
Prediction
266 joined
101 active
Starti
Mar 07, 25
Closei
Mar 30, 25
Reveali
Mar 31, 25
User avatar
Jaw22
Zindi africa
Ethics vs Bias
Data · 13 Mar 2025, 06:01 · 2

Dear All,

Upon analyzing the variables 'latitude' and 'longitude,' it appears that data points for two provinces—Eastern Cape (EC) and Western Cape (WC)—are entirely absent from the dataset. This exclusion raises several questions and concerns in my mind.

Firstly, I wonder why these two provinces were omitted. By excluding EC and WC, are we inadvertently creating an incomplete population in the dataset? Could this exclusion represent what is referred to as a "hidden dataset," which the model may ultimately need in order to generalize effectively?

Would it not be more appropriate to compile a hidden dataset by randomly sampling data from all provinces, rather than isolating EC and WC entirely? This approach might ensure a more representative sample while mitigating any potential data imbalance.

Additionally, could the high incidence of the 'target variable' in EC and WC have influenced this decision? If the target's prevalence is significantly higher in these regions, it may have been perceived as a potential source of bias in the dataset. That said, excluding these provinces outright could inadvertently impact the model's accuracy and fairness.

Finally, an ethical question arises: is it justifiable to exclude vulnerable populations, such as women in EC and WC, from the dataset? Ensuring that these regions are represented might provide insights that are essential to addressing their unique needs and challenges. Transparency regarding this exclusion, as well as an exploration of alternative approaches, would go a long way in fostering trust and equitable and ethical analysis.

I welcome your thoughts and perspectives on this matter. Please feel free to share your insights. These are simply my musings, and I mean no harm or offense to any stakeholders involved.

Kind regards,

Augustine aka Jaw22

Discussion 2 answers
User avatar
Jaw22
Zindi africa

Click on this link to view the map on my github profile:

International_Women-s_Day_Challenge_Zinzi/Screenshot_13-3-2025_91754_colab.research.google.com.jpeg at main · Jaw22/International_Women-s_Day_Challenge_Zinzi

You will notice it is very densely distributed in the Gauteng, KZN, Limpopo, FreeState and Mpumalanga provinces. Whereas, it is not so dense distributed in Northern Cape.

Western Cape and Eastern Cape is blank.

13 Mar 2025, 07:43
Upvotes 1
User avatar
CodeJoe

Good observation, never came in mind to check that.

14 Mar 2025, 19:02
Upvotes 0