I've noticed that there are several identical scores "112.4561452" on the list. I'm wondering if this is due to the data. I've been trying for a while now, but I'm still around 110 and haven't seen much improvement.
Maybe you are not considering the ID on train and test.
Separate the date from the id in the sample, and consider it by applying the same label encoder on train and test and train an LGBM you will easily reach 75.
A cautionary note...When using ID that appear in both the train and test (which is actually the ID in the sample submission file as there are no test set, unless of cause you split the train set and developed a test set), are u not guilty of committing a special form of 'LEAKAGE', in particular I believe it constitute a case of train-test contamination as data snooping occur through the use of ID variable, so if the same 'ID' that is in the train set is also in the test set(sample submission)...a cautionary note!!! So you will climb up the leader board only to be penalised later for leakage and overfitting....just sharing a thought!!!
112.45 is what you get if your submission is only zeros clicks per test sample
Focus, just like in real life, look out for shade(hidden anomolies and gremlins)!!!
Whenever I feel that I hit the wall, I start over from scratch.
Maybe you are not considering the ID on train and test.
Separate the date from the id in the sample, and consider it by applying the same label encoder on train and test and train an LGBM you will easily reach 75.
A cautionary note...When using ID that appear in both the train and test (which is actually the ID in the sample submission file as there are no test set, unless of cause you split the train set and developed a test set), are u not guilty of committing a special form of 'LEAKAGE', in particular I believe it constitute a case of train-test contamination as data snooping occur through the use of ID variable, so if the same 'ID' that is in the train set is also in the test set(sample submission)...a cautionary note!!! So you will climb up the leader board only to be penalised later for leakage and overfitting....just sharing a thought!!!