Hmmmm, perhaps it simply it-is-what-it-is, but ... I was expecting the sample submission file to contain values, not just 0 everywhere, for the "other" non-target variables. Actually, I was hoping there would be a "test" file with those values in. So we have to predict the target "blindly". No values for any of the other variables during those two weeks to help with the prediciton?
Some feature engineering will come in handy. The newly created features will really help in training and predicting for the two weeks.
Great point, on top of that a simple "starter notebook" would work well to clarify all the doubts that participants have.
Yip ...
Here is a snippet that I found useful - if you run this, you will get the same labels as the files provided.
mod_col = f"{ base }_{ cell }_{ beam }"Also, to pivot the sample submission (here I call it ss) I use this ... but of course, it ends up with just zeroes ...
EDIT: I'm fighting the editing system over here ... also, important omission here that I added in.
Anyhow, this is sort of the point of my post, I was hoping to do this and end up with a bunch of explanatory variables extracted from the sample submission to use to model the target variables with, but it seems we just predict into the unseen wihtout any additional variables. So I guess, it is what it is ...
And then ... to create files similar to the input files but from the sample submission
x [ mod_cols ].to_csv ( f"../output/{ i }={ j }.csv" )speaking as someone who does time-series predictions in the the real world .... this is often what you actually would get. sure, sometimes you can back out some day of week or add some future holidays if you have data. but usually we dont know any of the X's or Y's for two week from now.
@Skaak
I tend to concur, the ds is very ruff and rudimentary. Getting it together and in a train / test format is a mean task, unless you the Optimus Prime of DS. Also no starter webinar or starter notebook!
Jaw!
So nice to see you - how are you doing my good friend? Did you play any of the fossil comps? This reminds me of those. Also a bit like Sasol. Oh well ... as CB says, it is what it is.
I think, if this was a real world one, you'd struggle to beat ARIMA ... just work the stuff through that and you have decent forecasts and a relatively simple model that scales well to this sort of a thing. But ... to win a comp, you have to dig deeper I guess.
Best wishes for this one ... you are the shark of DS of course and, afaik, time series are your thing. Hope you do really well here.