Primary competition visual

Fossil Demand Forecasting Challenge

$5 000 USD
Completed (over 3 years ago)
Forecast
1009 joined
200 active
Starti
May 24, 22
Closei
Aug 28, 22
Reveali
Aug 28, 22
User avatar
Rakesh_Jarupula
National Institute of Technology Silchar
Can anyone explain dataset @Zindi
Data Ā· 8 Jun 2022, 03:20 Ā· 2

Hi @zindi team,

Can we get a little more info about how the data can be used for training model. The number of features are different in both training and test sets.

Thanks.

Discussion 2 answers

+++

22 Jun 2022, 17:20
Upvotes 0
User avatar
University of ilorin

The dataset is a bit tricky, however pretty straightforward. The challenge revolves round weather forecasting. In the train dataset, you have up to 30 columns because those columns are actually needed to infer certain insights from after proper data wrangling. That is why it is not present in the test dataset because it is believed that based on the knowledge and/or inference made, we only need the month, year and gender to be able to make accurate prediction. So I suggest that we focus more of the data processing steps. One can decide to concat all the similar features in the train set, divide by the number of month to get average_per_month selling etc like that

4 Jul 2022, 20:21
Upvotes 4