Discussion
Swahili News Dataset is now available in Datasets library from HuggingFace
published 7 Jan 2021, 09:18
edited 2 minutes later

Hello community Happy new year to everyone 2021 I'm happy to let you know that for anyone doing data science or machine learning in NLP. The Swahili news dataset is now available in the datasets library from HuggingFace (https://github.com/huggingface/datasets). Install the datasets library and access the dataset with 3 lines of code.

from datasets import load_dataset

# download the dataset

swahili_news = load_dataset('swahili_news')

# access the downloaded dataset

swahili_news['train'][0]