Primary competition visual

Lacuna Solar Survey Challenge

Helping Madagascar
$5 000 USD
Completed (12 months ago)
Computer Vision
Prediction
729 joined
247 active
Starti
Feb 14, 25
Closei
Mar 23, 25
Reveali
Mar 24, 25
Achieved LB 0.7412 on 3rd Submission – Here's My Approach
Help · 15 Mar 2025, 13:46 · 8

My solution combines multi-modal data fusion and robust training techniques to tackle the solar panel counting challenge. Key components include:

  1. Model ArchitectureBackbone : EfficientNetV2 variant for image feature extraction Metadata Integration : Encoded image origin (D/G) and placement type (roof/ground) via one-hot/dense embeddings Fusion : Concatenated visual features + metadata processed through a 2-layer regression head
  2. Data StrategyCross-Validation : Stratified K-Fold to handle class imbalance Augmentation Pipeline :Dynamic spatial transforms (geometric + color) Targeted dropout patterns to reduce overfitting
  3. Training ProtocolLoss : MAE-focused objective with gradient scaling Optimization : AdamW with cosine LR scheduling Infrastructure : Mixed-precision training for efficiency
  4. Inference Enhancements : Test-time augmentation (TTA) with consistent preprocessing Prediction aggregation from multiple model checkpoints

Validation Insights

  • Achieved steady MAE improvement across epochs (1.25-2.35 range)
  • Metadata integration provided ~8% performance boost vs image-only baseline

This approach balances model capacity, data diversity, and regularization to handle the dataset's unique challenges. Would love to hear about others' strategies for metadata utilization and augmentation design!

Discussion 8 answers
User avatar
KhutsoMphelo
Stellenbosch University

Thank you @zulu40

15 Mar 2025, 13:47
Upvotes 1
User avatar
CodeJoe

Interesting all along I was just training images. Thank you @zulo40. Much appreciated

15 Mar 2025, 13:49
Upvotes 1
User avatar
Koleshjr
Multimedia university of kenya

what's your local Mae for all folds ?

15 Mar 2025, 13:51
Upvotes 1

My Average Val MAE was somewhat near 1.25155

User avatar
Koleshjr
Multimedia university of kenya

nice , thank you for sharing

For future i think i will experiment with Vision Transformers

15 Mar 2025, 14:00
Upvotes 0

The competition has a file-size limit (many Kaggle competitions cap submissions around 20–30 MB). Even if your file has the Slice Master same number of rows, differences in formatting or precision can make it much bigger.

11 Dec 2025, 04:22
Upvotes 0