Hi everyone, I’m curious about how much external datasets are actually helping in this competition. Has anyone managed to achieve a strong score using only the data provided in train.csv? Or are external sources proving essential for competitive results?
I havent tried external datasets yet, but with only the train csv , I have hit a ceiling at 0.31 plb score. I dont know what the 0.4xx and 0.5xx people are doing
did you achieve this with ensemble learning or with pretrained models ?
Indirect answer, but I made a post on the datasets I used, hope it helps.