If you still think of participating in the SDG competition, here is a clean version of train set (HTML markups removed and labels extracted): https://github.com/pawelmorawiecki/Zindi_SDG_competition
You find there a new csv file and the source code behind it.
many thanks:)
Thanks...this is helpful.
thanks