🤝 AI in Focus: Our Solution

Fossil Demand Forecasting Challenge

$5 000 USD

Completed (almost 4 years ago)

Skills you will learn

Forecast

1012 joined

200 active

Info Data Chat Leaderboard

Start

May 24, 22

Aug 28, 22

Reveal

Aug 28, 22

Mohamed-Eltayeb

Our Solution

Notebooks · 1 Sep 2022, 09:01 · 6

I was waiting for the top 3 to share their solutions, but sadly no one did.

Btw, for anyone interested here is part of our solution:

Projects_Portfolio/[Zindi] Fossil Demand Forecasting Challenge - Regression at main · mohammad2012191/Projects_Portfolio (github.com)

This is not the final one, but it's very similar to it. This would give a score about 165000.

Please put a star if you found it useful.

Discussion 6 answers

thatdataanalyst

thanks for sharing!!

I was wondering how you checked your model performances. Your notebook doesn't seem to include cross validation part. Does this mean, you didn't check your perfomance before submission? maybe check how the model improves on public leaderboard?

I want to ask one more thing... is Catboost the only model in your final solution too? If yes, why did you not try any other popular models, such as lightgbm, xgb...

thanks in advance,

1 Sep 2022, 11:08

Upvotes 0

Mohamed-Eltayeb

The cross validation part was a bit messy in my notebook, so I deleted it from this notebook. But, I just used the last 4 months as a validation set, and the rest of data as a training set.

Tbh, I didn't have a lot of time as that I entered the competition a little bit late. So I just stucked with catboost and foucs on feature engineering.

replied to thatdataanalyst1 Sep 2022, 11:34

Upvotes 2

thatdataanalyst

thanks for your reply...!

i see your point.

I asked question because i wanted to know how other participants conducted validation. tbh i stuggled a lot to appropriately validate my model. also my CV score didn't indicate leaderbord score well...

by the way, did your CV score has good correlation against submission score (public/private)?

replied to Mohamed-Eltayeb1 Sep 2022, 11:42

Upvotes 0

Mohamed-Eltayeb

Yes, it had a very good correlation with the lb. When forecasting just make sure you make your validation period the same as the test set (4 months) and avoid adding features lagged less than this period.

Also the train set must represent the past values, and the validation set must represent the future.

replied to thatdataanalyst1 Sep 2022, 11:46

Upvotes 2

anasr

Hey Mohamed, thank you for sharing, I also went for a catboost :). I tried other models (RF, Arima, lgbm, xgb, MLP) but when I checked the performance using cross validation, the performance fluctuated a lot so I went for a catboost.. at least the score was stable through the testing sets and it will perform well in the future I hope :)

It's my first competition on zindi, can we share the code before/after the final leaderbord ? If a zindi staff member can tell me I would be happy to share too !

3 Sep 2022, 22:48

Upvotes 4

Mohamed-Eltayeb

Well Done! Getting the 2nd place in your first competition is amazing!

I think you can share a brief summary about your approach whatever is your place in the lb. Not sure about the code though especially if you were in the prize area.

replied to anasr4 Sep 2022, 09:36

Upvotes 1

Join the largest network for
data scientists and AI builders

About FAQs

Status