Primary competition visual

IBM SkillsBuild Hydropower Climate Optimisation Challenge

Helping the World
$3 000 USD
Completed (12 months ago)
Prediction
Forecast
1231 joined
466 active
Starti
Mar 03, 25
Closei
Apr 13, 25
Reveali
Apr 14, 25
User avatar
data_style_bender
First Place Solution
15 Apr 2025, 14:44 · 12

🧹 Data Preprocessing & Cleaning

  • Excluded devices not present in the test set to prevent leakage and noise
  • Aggregated 5-minute data into daily features with comprehensive statistics for voltage, current, and power factors
  • Identified and filtered offline periods by detecting zero-consumption weeks
  • Focused exclusively on online days to better match test distribution patterns

🌦️ Feature Engineering (Comprehensive Climate Integration)

  • Climate data integration: Incorporated detailed statistics of temperature, dewpoint, precipitation, snowfall, wind components, and snow cover
  • Temporal cyclical encodings: Created sine/cosine transformations for day of week, day of year, month, and week to handle cyclical patterns
  • Pakistan-specific cultural features: Added holiday flags, Ramadan period indicators, and seasonal delineations specific to Kalam region
  • Temperature trend analysis: Generated temperature acceleration, volatility measures, ewm averages, and extreme temperature indicators
  • Heating/cooling indicators: Calculated heating and cooling degree days (base 18°C), temperature-dewpoint differences, and day-in-season positions
  • Weather interactions: Modeled interactions between temperature and weekday, creating specialized features for each day of the week

🧪 Strategic Data Segmentation

  • Divided the dataset into four carefully selected temporal segments:
  • Data1: Late summer/early fall (August-September 2024 and October 2023)
  • Data2: Winter and mid-summer (November-December 2023 and July 2024)
  • Data3: Remaining periods with distinct consumption patterns
  • Data4: Complete dataset for a robust global model

🧠 Advanced Ensemble Modeling

  • Multi-configuration approach: Trained 7 different LGBM configurations per segment:
  • Precise (conservative with deep trees)
  • Feature-selective (aggressive feature selection)
  • Robust (focused on outlier resistance)
  • Deep forest (very deep trees with many estimators)
  • Highly regularized (to prevent overfitting)
  • Fast learner (high learning rate for quick convergence)
  • Balanced (optimized bias-variance tradeoff)
  • Bayesian optimization for weights: Used Bayesian optimization to find optimal weights for combining base models instead of a simple meta-model
  • K-fold validation: Implemented 5-fold cross-validation with ensemble weight optimization per fold
  • Multi-level ensemble: Combined segment-specific models with a sophisticated weighting scheme
Discussion 12 answers

Can you attach the notebook, please?

15 Apr 2025, 14:49
Upvotes 0
User avatar
data_style_bender

when the code review is complete

15 Apr 2025, 14:51
Upvotes 0

Are you not on the good to share the notebook now since the winners have been announced? Well done again!

User avatar
100i
Ghana Health Service

Congratulations on your victory! impressive feature engineering capturing important granularities in the data. Can't wait to see the code

15 Apr 2025, 15:56
Upvotes 0
User avatar
Knowledge_Seeker101
Freelance

Congratulations 🎉 , nic work,I like modelling

15 Apr 2025, 17:08
Upvotes 0

How did you think of these seven different configurations, and why seven?

15 Apr 2025, 18:46
Upvotes 0
User avatar
data_style_bender

Firstly I tried one LGBM model in my 5 fold cross validation, which was good , I then tried 3 which was better and then 7... due to deadline, I didn't try more.. but I am sure mixing with other models might even outperform

Nice work!! Congratulation

16 Apr 2025, 14:40
Upvotes 0
User avatar
Koleshjr
Multimedia university of kenya

What was your score for your best single model?

17 Apr 2025, 09:28
Upvotes 0

Are you not on the good to share the notebook now since the winners have been announced? Well done again!

17 Apr 2025, 10:31
Upvotes 0

Great work. Congratulations

5 Jun 2025, 09:54
Upvotes 0