Primary competition visual

DigiCow Farmer Training Adoption Challenge

Helping Kenya
€8 250 EUR
Under code review
Data analysis
Classification
895 joined
388 active
Starti
Jan 28, 26
Closei
Mar 01, 26
Reveali
Mar 02, 26
Data Dictionary
Data · 10 Feb 2026, 07:51 · 5

I am struggling to understand some explanations for features in the data dictionary, specifically: has_topic_traine_on - its description does not make sense, perhaps some words are missing, and topics: are these topics the farmer has already been trained on, or the potential ones they could be trained on?

Would appreciate some help

Discussion 5 answers
User avatar
Juliuss
Freelance

yes this is a valid issue raised..@AJoel could you also shed more light on feature group_name? I see it in the datasets but no mention of it in the data dictionary.

11 Feb 2026, 21:52
Upvotes 0
User avatar
AJoel
Zindi

Some farmers belong to cooperative and are organised into group. So that is what the group_name refers to.

User avatar
Juliuss
Freelance

Perfect👌.

Thanks so much for the feedback @Ajoel

User avatar
Juliuss
Freelance

@Ajoel, these are my observations and assumptions based on your response

Observations;

  • Most farmers do not belong to a cooperative. 89.2% in train and about 96.9% in test
  • group_name has no nulls-each farmer has a group still despite them not belonging to a cooperative

Assumptions:

  1. Either everyone is assigned some default group
  2. Or “non-cooperative” farmers are still grouped administratively

Incase I am wrong with the assumptions you can correct

User avatar
AJoel
Zindi

Hello: has_topic_trained_on is a boolean flag. It is 1 if the specific topic that the farmer was trained on is recorded. It should be has_topic_trained_on instead of "has_topic_traine_on"

11 Feb 2026, 23:06
Upvotes 1