In this challenge, we are tasked with two objectives: first, analyzing how features contribute to the target, and second, forecasting the targets. I would like to know if we need to use the same model for both tasks, or if we can use two different models—one to study feature importance on the targets in the training dataset, and another to forecast the clicks using only the time series data of the clicks.
is this approache allowed?
I was wondering the same. Given that they do not provide the features for the target dates, it became a forecasting challenge and those features are relatively useless
I assumed that we could use random forest for feature importance as an example then use the top 5 features to implement a multi-step forecasting model using the five features which can be an LSTM or tree based model.
that would be one way to do it, but what if not using any features at all will give a better score
wait are you using no feature at all😅
yes
16.41180069 with no features??😲oh you magician!!!!
Which forecasting model are u using arima and how or any machine learning model for forecasting,can u give any hints
i am using catboost with a function f(y) for the final prediction. It was mostly luck finding f(y) that would reduce the rmse very well, and i am not sure as i may be overfitting to the 20% so there's that.
How did u created the feature for the test dataset did u used the the forecasted values for the coast , impression or feature engineered new columns like aggregation of the clicks for model
The description suggests that you have to use the same model for both tasks.