In the orders data, there are multiple values for the same CID X LOC_NUM X VENDOR. Should we keep the multiple rows considering they are different orders or keep one and delete the rest? How do I work around this?
there is no wrong answer!
It will be great if you try them both.
Within the second approach, you will maybe face an overfitting problem (try it).
there is no wrong answer!
It will be great if you try them both.
Within the second approach, you will maybe face an overfitting problem (try it).