💳 Trending Now: Cv 0.995 F1

The test data has Ghana data which is not in your training data. I suspect the large difference is in the introduction of Ghana dataset in the test set.

replied to Kirito9 Jan 2025, 14:29

Upvotes 0

Kirito

Thanks For Replying!! Yeah you are right test data has alot of irregularity because of Ghana. Let's see what happens! Will have to wait till the deadline!!

replied to the_specialist9 Jan 2025, 18:22

Upvotes 0

ahmadmwali

I think it's becauseo the public leaderboard that we have now only has 30% of the total dataset. The test dataset also has a different distribution from the train and val distributions. These may result in your score being significantly lower n the public lb but can be a lot higher on the private lb. Looking at your CV score, I'd suggest you stick to your results and trust your CV. There will definitely be a massive shake-up in the private lb and you'll most likely be at the top.

9 Jan 2025, 18:13

Upvotes 0

Kirito

Thanks For Replying! I understood it now. Let's hope for the best!! :))

replied to ahmadmwali9 Jan 2025, 18:18

Upvotes 1

Luis_okech

Multimedia university of kenya

getting a high score without cv 71 but low on groupfold and kfold. for those who are at 80 o lb, is it feature engineering or model tuning that you are using

9 Jan 2025, 21:49

Upvotes 0

Kirito

Model tuning won't impact much if you haven't done feature engineering.

replied to Luis_okech10 Jan 2025, 04:25

Upvotes 0

Kirito

I tried feature selection!

10 Jan 2025, 03:44

Upvotes 0

AI_Maven

University of Benin

Did you address imbalance with Smote?

When i used smote (and feature engineering) i was getting a cv score of 0.996 and a val score of 0.9983.

But when i addressed the imbalance with a different method, the cv changed, i was getting a score of around 0.83

10 Jan 2025, 15:19

Upvotes 1

josh_amayo

Oval

How'd your lb score change with both methods? I always get worse scores on both with smote so I don't even bother using it

replied to AI_Maven10 Jan 2025, 15:21

Upvotes 0

Kirito

I tried smote and addresses imbalanced too!

replied to AI_Maven10 Jan 2025, 16:23

Upvotes 0

Kirito

With smote and without smote the impact on lb wasnt much. Just a 0.01 difference was there.

replied to josh_amayo10 Jan 2025, 16:25

Upvotes 0

josh_amayo

Oval

Fair enough, when I set sample weights I got a much better score than smote

replied to Kirito10 Jan 2025, 16:41

Upvotes 0

Kirito

Have you tried folding or grouping techniques?

replied to josh_amayo10 Jan 2025, 16:43

Upvotes 0

AI_Maven

University of Benin

For cross_validation?

replied to Kirito10 Jan 2025, 16:48

Upvotes 0

heythem

mohamed boudiaf university

u're using the blanced data for testing after u blace the data and train the model on it try to eval the model on data that is not balaced keep it close to the original

12 Jan 2025, 14:41

Upvotes 0

Join the largest network for
data scientists and AI builders

About FAQs

Status