Primary competition visual

Telangana Crop Health Challenge

Helping India
€6 900 EUR
Completed (~1 year ago)
Classification
1073 joined
285 active
Starti
Nov 08, 24
Closei
Feb 09, 25
Reveali
Feb 10, 25
The "District", "Sub-District", "CLast" columns in the test set contains entries which are not present in the train set
Data · 9 Jan 2025, 00:15 · 2

Hi everyone, after some exploratory data analysis, I have found the followings:

- The "District" column in the test set contains 'Kumurambheem Asifabad', 'Sangareddy' which are not present in the train set

- The "Sub-District" column in the test set contains

'Adavidevulapally', 'Addakal', 'Adilabad Urban', 'Bibipet', 'Chinnagudur', 'Doulthabad', 'Gandeed', 'Gundlapally', 'Gundmal', 'Jadcherla', 'Jainoor', 'Kerameri', 'Kesamudram', 'Kethepally', 'Kuntala', 'Maddur', 'Mahabubabad', 'Malegaon', 'Marriguda', 'Munugode', 'Nakrekal', 'Nampally', 'Narnoor', 'Neredugommu', 'Nirmal Rural', 'Nizampet', 'Papannapet', 'Parvathagiri', 'Ponkal', 'Sarangapur', 'Sathnala', 'Talamadugu', 'Tekmal', 'Toopran', 'Vatpally'

which are not present in the train set

- The "CNext" column in the test set contains "cotton" which are not present in the train set, although this column is probably related to the "CLast" column, so I'm not sure if this counts or not.

Do you guys think this might have an effect on the results ? Thank you.

Discussion 2 answers

@Zindi Can you please take a look, thank you.

10 Jan 2025, 10:42
Upvotes 0
User avatar
Koleshjr
Multimedia university of kenya

it's intentional, Your cv should reflect that