🤝 Challenge Chat: Getting errors due to train an...

Fossil Demand Forecasting Challenge

$5 000 USD

Completed (almost 4 years ago)

Skills you will learn

Forecast

1012 joined

200 active

Info Data Chat Leaderboard

Start

May 24, 22

Aug 28, 22

Reveal

Aug 28, 22

Jimal

Obafemi awolowo university

Getting errors due to train and test column numbers

Data · 1 Jul 2022, 18:38 · 8

Hello,

The test and train data available have let different number of columns. After modeling and applying a machine learning algorithm, when I try to predict, I get an error that says the test data (6 columns) has fewer columns than the train (about 45 columns).

Please what can I do in this situation?

Discussion 8 answers

afrogrit

Hi @Jimal, you need to add the missing cols before applying your model. recall the final model will be expecting the same number of constants you used to build the model, then apply weights before predicting ~ hope that helps

1 Jul 2022, 22:04

Upvotes 0

Jimal

Obafemi awolowo university

Thank you for the response.

if I'm to add 30+ column to the test data, what will be the values of each column. I feel it's not cool to come up with random values for each columns.

replied to afrogrit1 Jul 2022, 22:08

Upvotes 0

afrogrit

I think that's the challenge. I haven't not had the time to model as yet on this challenge but I think that's the key

replied to Jimal1 Jul 2022, 22:19

Upvotes 1

Jimal

Obafemi awolowo university

Okay

Thank you

replied to afrogrit1 Jul 2022, 22:33

Upvotes 0

flamethrower

Hello Jimal,

This is a forecasting challenge, majority of the information in train is target related information or by products /breakdown of target information (selliin). Hence, these are all unknown features at test time, they are provided for you to build a profile/representation of the pattern of distribution for the product of interest, such that given only a month, year and product name, you are able to infer future forecasts. You need to structure the dataset such that given database information, there is a corresponding sellin target to be predicted or apply a time series model, sequence to sequence model to represent the sellin trend of the dataset.

I hope this helps.

replied to Jimal2 Jul 2022, 18:10 (edited 16 days later)

Upvotes 0

Jimal

Obafemi awolowo university

Yes. To an extent. Thank you

replied to flamethrower2 Jul 2022, 18:16

Upvotes 0

Jimal

Obafemi awolowo university

Thinking about it again, your advice gave me an idea.

Thank you