📜 Let's Talk About: Achieved LB 0.7412 on 3rd Subm...

Lacuna Solar Survey Challenge

Helping Madagascar

$5 000 USD

Completed (~1 year ago)

Skills you will learn

Computer Vision

Prediction

734 joined

247 active

Info Data Chat Leaderboard

Start

Feb 14, 25

Mar 23, 25

Reveal

Mar 24, 25

zulo40

Achieved LB 0.7412 on 3rd Submission – Here's My Approach

Help · 15 Mar 2025, 13:46 · 8

My solution combines multi-modal data fusion and robust training techniques to tackle the solar panel counting challenge. Key components include:

Model ArchitectureBackbone : EfficientNetV2 variant for image feature extraction Metadata Integration : Encoded image origin (D/G) and placement type (roof/ground) via one-hot/dense embeddings Fusion : Concatenated visual features + metadata processed through a 2-layer regression head
Data StrategyCross-Validation : Stratified K-Fold to handle class imbalance Augmentation Pipeline :Dynamic spatial transforms (geometric + color) Targeted dropout patterns to reduce overfitting
Training ProtocolLoss : MAE-focused objective with gradient scaling Optimization : AdamW with cosine LR scheduling Infrastructure : Mixed-precision training for efficiency
Inference Enhancements : Test-time augmentation (TTA) with consistent preprocessing Prediction aggregation from multiple model checkpoints

Validation Insights

Achieved steady MAE improvement across epochs (1.25-2.35 range)
Metadata integration provided ~8% performance boost vs image-only baseline

This approach balances model capacity, data diversity, and regularization to handle the dataset's unique challenges. Would love to hear about others' strategies for metadata utilization and augmentation design!

Discussion 8 answers

KhutsoMphelo

Stellenbosch University

Thank you @zulu40

15 Mar 2025, 13:47

Upvotes 1

CodeJoe

Interesting all along I was just training images. Thank you @zulo40. Much appreciated

15 Mar 2025, 13:49

Upvotes 1

zulo40

My pleasure

replied to CodeJoe15 Mar 2025, 13:57

Upvotes 1

Koleshjr

Multimedia university of kenya

what's your local Mae for all folds ?

15 Mar 2025, 13:51

Upvotes 1

zulo40

My Average Val MAE was somewhat near 1.25155

replied to Koleshjr15 Mar 2025, 13:57

Upvotes 0

Koleshjr

Multimedia university of kenya

nice , thank you for sharing

replied to zulo4015 Mar 2025, 14:02

Upvotes 1

zulo40

For future i think i will experiment with Vision Transformers

15 Mar 2025, 14:00

Upvotes 0

athingsuitsola

The competition has a file-size limit (many Kaggle competitions cap submissions around 20–30 MB). Even if your file has the Slice Master same number of rows, differences in formatting or precision can make it much bigger.

11 Dec 2025, 04:22

Upvotes 0

Join the largest network for
data scientists and AI builders

About FAQs

Status