AI4D - African Language Dataset Challenge

Kathleen Siminyu, Sackey Freshia


Abstract
As language and speech technologies become more advanced, the lack of fundamental digital resources for African languages, such as data, spell checkers and PoS taggers, means that the digital divide between these languages and others keeps growing. This work details the organisation of the AI4D - African Language Dataset Challenge, an effort to incentivize the creation, curation and uncovering to African language datasets through a competitive challenge, particularly datasets that are annotated or prepared for use in a downstream NLP task.
Anthology ID:
2020.winlp-1.18
Volume:
Proceedings of the Fourth Widening Natural Language Processing Workshop
Month:
July
Year:
2020
Address:
Seattle, USA
Editors:
Rossana Cunha, Samira Shaikh, Erika Varis, Ryan Georgi, Alicia Tsai, Antonios Anastasopoulos, Khyathi Raghavi Chandu
Venue:
WiNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
68–77
Language:
URL:
https://aclanthology.org/2020.winlp-1.18
DOI:
10.18653/v1/2020.winlp-1.18
Bibkey:
Cite (ACL):
Kathleen Siminyu and Sackey Freshia. 2020. AI4D - African Language Dataset Challenge. In Proceedings of the Fourth Widening Natural Language Processing Workshop, pages 68–77, Seattle, USA. Association for Computational Linguistics.
Cite (Informal):
AI4D - African Language Dataset Challenge (Siminyu & Freshia, WiNLP 2020)
Copy Citation:
Video:
 http://slideslive.com/38929555