Primary competition visual

IBM SkillsBuild Hydropower Climate Optimisation Challenge

Helping the World
$3 000 USD
Completed (~1 year ago)
Prediction
Forecast
1233 joined
462 active
Starti
Mar 03, 25
Closei
Apr 13, 25
Reveali
Apr 14, 25
User avatar
100i
Ghana Health Service
On 'kwh' train/test distribution
Help ยท 2 Apr 2025, 19:23 ยท 6

Has anyone tried tweedie as regression objective (irrespective of model type or CV strategy)?

It appears tweedie regression demonstrates strong perf on local cv but not great on public LB.

Anyway, it's still a struggle attempting to harmonize my local cv and public LB, but I'm curious to know if anyone also noticed this trend.

I made this observation whilst experimenting with xgb and lgbm. Happy to know your thoughts.

Discussion 6 answers
User avatar
CodeJoe

Yes very poor actually. But good cv. Tweedie gave me like 2 RMSE score on CV but 11 on lb

2 Apr 2025, 21:17
Upvotes 2
User avatar
100i
Ghana Health Service

I see... I got like 1.2 rmse cv but 13.5 on lb

User avatar
CodeJoe

I am not sure that's a great option though.

User avatar
100i
Ghana Health Service

I guess so

User avatar
Krishna_Priya

Tweedie is meant for 0 inflated distributions. The training data has this distribution however the test has nowhere close to the distribution you observe in train.

why do i hypothesise that?

- lookat the target shift between last 31 days in training and the data prior to that. my best guess is that in test we would have a distribution similar to the last 31 days in training if not more :)

3 Apr 2025, 03:18
Upvotes 3
User avatar
100i
Ghana Health Service

I see your perspective. I need to look at the data from that angle. Thanks for your insights.