arXiv Open Access 2023

HausaNLP at SemEval-2023 Task 12: Leveraging African Low Resource TweetData for Sentiment Analysis

Saheed Abdullahi Salahudeen Falalu Ibrahim Lawan Ahmad Mustapha Wali Amina Abubakar Imam Aliyu Rabiu Shuaibu +10 lainnya
Lihat Sumber

Abstrak

We present the findings of SemEval-2023 Task 12, a shared task on sentiment analysis for low-resource African languages using Twitter dataset. The task featured three subtasks; subtask A is monolingual sentiment classification with 12 tracks which are all monolingual languages, subtask B is multilingual sentiment classification using the tracks in subtask A and subtask C is a zero-shot sentiment classification. We present the results and findings of subtask A, subtask B and subtask C. We also release the code on github. Our goal is to leverage low-resource tweet data using pre-trained Afro-xlmr-large, AfriBERTa-Large, Bert-base-arabic-camelbert-da-sentiment (Arabic-camelbert), Multilingual-BERT (mBERT) and BERT models for sentiment analysis of 14 African languages. The datasets for these subtasks consists of a gold standard multi-class labeled Twitter datasets from these languages. Our results demonstrate that Afro-xlmr-large model performed better compared to the other models in most of the languages datasets. Similarly, Nigerian languages: Hausa, Igbo, and Yoruba achieved better performance compared to other languages and this can be attributed to the higher volume of data present in the languages.

Topik & Kata Kunci

Penulis (15)

S

Saheed Abdullahi Salahudeen

F

Falalu Ibrahim Lawan

A

Ahmad Mustapha Wali

A

Amina Abubakar Imam

A

Aliyu Rabiu Shuaibu

A

Aliyu Yusuf

N

Nur Bala Rabiu

M

Musa Bello

S

Shamsuddeen Umaru Adamu

S

Saminu Mohammad Aliyu

M

Murja Sani Gadanya

S

Sanah Abdullahi Muaz

M

Mahmoud Said Ahmad

A

Abdulkadir Abdullahi

A

Abdulmalik Yusuf Jamoh

Format Sitasi

Salahudeen, S.A., Lawan, F.I., Wali, A.M., Imam, A.A., Shuaibu, A.R., Yusuf, A. et al. (2023). HausaNLP at SemEval-2023 Task 12: Leveraging African Low Resource TweetData for Sentiment Analysis. https://arxiv.org/abs/2304.13634

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2023
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓