Outlier Treatment:
My solution greatly benefited from eliminating outliers. I removed target values that were below 20 AGBD and above 300 AGBD.
Feature Engineering:
Cloud Masking:
Both the training and testing images exhibited a significant presence of clouds. To address this issue, I generated a cloud mask and replaced the corresponding pixels in the training and testing images with their mean values.
Vegetation Indices:
I generated a total of 187 distinct vegetation indices. The top 10 vegetation indices, identified through model feature importance, are as follows:
Training and validation sets:
The training data was divided into 25 folds, with the sixth fold utilized as the validation set, and the remaining 24 folds used for training.
Modelling:
The model employed for this solution was Lightgbm, with the objective set to quantile.
Elegant solution. Congrats :)
Elegant one.
Congrats ! I thought luck played an important role in the final LB but you proved me wrong. It seems that preprocessing was the key.
Hats off!