Primary competition visual

AirQo African Air Quality Prediction Challenge

$3 000 USD
Completed (over 1 year ago)
Prediction
1029 joined
514 active
Starti
Mar 15, 24
Closei
Jun 16, 24
Reveali
Jun 16, 24
User avatar
Yisakberhanu
wachemo university
Small Data, Big Problems?
Data · 16 May 2024, 09:04 · 4
  • Small Data, Big Problems? Limited training data can make models overfit, meaning they perform well on the competition set but struggle in the real world. One outlier in the test data could determine the winner - not ideal!
  • Validation! Validating models with small datasets is tough. Test data is covered different locations or scenarios compared to training data, making it hard to assess how well a model generalizes. Especially with RMSE as the metric, outliers can have a big impact.
Discussion 4 answers
User avatar
marching_learning
Nostalgic Mathematics

Yes I call for the authors to change the metrics to MAE. Because as it stands, the winner will be the model with the luck of best performing on outliers.

16 May 2024, 09:18
Upvotes 1
User avatar
Yisakberhanu
wachemo university

you know, zindi is funny of RMSE

I came to the same conclusion

16 May 2024, 17:41
Upvotes 1
User avatar
Yisakberhanu
wachemo university

I hope the best solution will not be putted in trash