arXiv Open Access 2025

Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead

Jesujoba O. Alabi Michael A. Hedderich David Ifeoluwa Adelani Dietrich Klakow
Lihat Sumber

Abstrak

With over 2,000 languages and potentially millions of speakers, Africa represents one of the richest linguistic regions in the world. Yet, this diversity is scarcely reflected in state-of-the-art natural language processing (NLP) systems and large language models (LLMs), which predominantly support a narrow set of high-resource languages. This exclusion not only limits the reach and utility of modern NLP technologies but also risks widening the digital divide across linguistic communities. Nevertheless, NLP research on African languages is active and growing. In recent years, there has been a surge of interest in this area, driven by several factors-including the creation of multilingual language resources, the rise of community-led initiatives, and increased support through funding programs. In this survey, we analyze 884 research papers on NLP for African languages published over the past five years, offering a comprehensive overview of recent progress across core tasks. We identify key trends shaping the field and conclude by outlining promising directions to foster more inclusive and sustainable NLP research for African languages.

Topik & Kata Kunci

Penulis (4)

J

Jesujoba O. Alabi

M

Michael A. Hedderich

D

David Ifeoluwa Adelani

D

Dietrich Klakow

Format Sitasi

Alabi, J.O., Hedderich, M.A., Adelani, D.I., Klakow, D. (2025). Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead. https://arxiv.org/abs/2505.21315

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓