Primary competition visual

Zindi User Behaviour Birthday Challenge

Helping Africa
$3 000 USD
Completed (~4 years ago)
Prediction
871 joined
174 active
Starti
Sep 24, 21
Closei
Jan 23, 22
Reveali
Jan 23, 22
DATA PREPROCESSING, MODEL SELECTION
Help · 2 Dec 2021, 03:38 · 3

Can anyone help me with the different data pre processing techniques required ,for the available csv files and the which model has to be selected???

Discussion 3 answers
User avatar
ff
University of Yaoundé I

Hello @sky_179,

Of all the given datasets I only used the one user.csv. Then I made the usual preprocessings. To deal with the unbalanced data, I split the data into multiple balanced data. Then I trained LGBM on .

2 Dec 2021, 11:02
Upvotes 0

don't you think that there will a data leakage

User avatar
ff
University of Yaoundé I

No. I'm only using the Users.csv because I couldn't find any ID_User from Test.csv in the other datasets. So except User.csv, the other datasets do not contain any information about the users contained in Test.csv