I originally thought that the same event_ids would be present in the train and test sets, however upon further inspection this does not seem to be the case. Is it safe to assume that our model is not expected to predict based on the location of observed precipitation? So we're only concerned with precipitation trends across all locations on the same day. I'm not too experienced with data science and AI/ML so I'm not sure how we could train a model based on one set of locations and then have it predict on a different set of unseen locations, if that makes sense. Thanks!
I believe the goal intended by the organizers is to have a model that, given a place with certain morphological characteristics, and a sequence of precipitation data, is able to generally assess if and when a flooding event will happen. The model learns from some location images with the relative precipitation time series (that could be registered at different times of the year and in different years, so not always in the same days) and predicts a flooding event for a new location, with new precipitation data. Quite ambitious task with the scarce data provided, but yeah. Hope I answered your question