Primary competition visual

Ghana’s Indigenous Intel Challenge [BEGINNERS ONLY]

Helping Ghana, Algeria
and 53 other countries
  • Ghana
  • Algeria
  • Angola
  • Benin
  • Botswana
  • Burkina Faso
  • Burundi
  • Cameroon
  • Cabo Verde
  • Central African Republic
  • Chad
  • Comoros
  • Congo (Republic of the)
  • Congo (Democratic Republic of the)
  • Djibouti
  • Egypt
  • Equatorial Guinea
  • Eritrea
  • Eswatini
  • Ethiopia
  • Gabon
  • Gambia
  • Guinea
  • Guinea-Bissau
  • CĂ´te d'Ivoire
  • Kenya
  • Lesotho
  • Liberia
  • Libya
  • Madagascar
  • Malawi
  • Mali
  • Mauritania
  • Mauritius
  • Morocco
  • Mozambique
  • Namibia
  • Niger
  • Nigeria
  • Rwanda
  • Sao Tome and Principe
  • Senegal
  • Seychelles
  • Sierra Leone
  • Somalia
  • South Sudan
  • South Africa
  • Sudan
  • Tanzania
  • United Republic of
  • Togo
  • Tunisia
  • Uganda
  • Zambia
  • Zimbabwe
  • Scroll to see more
$2 500 USD
Challenge completed ~2 months ago
Prediction
910 joined
565 active
Starti
Aug 14, 25
Closei
Oct 12, 25
Reveali
Oct 12, 25
User avatar
Mithamo_Morgan
user_id
4 Sep 2025, 02:21 · 2

Can including user_id as a feature in my model cause data leakage?

Discussion 2 answers
User avatar
Satti_Tareq

I do not think so since there is a temporal shift between train and test, but it can be a good base for aggregation features. and since some users are only in the test set a proper cv startegy will be by holding some users to validate on so you get to mimic the train test split on predicting on unseen users.

5 Sep 2025, 05:25
Upvotes 1
User avatar
Mithamo_Morgan

Thanks Satti for this insightful idea.