arXiv Open Access 2025

Analysis of LLM as a grammatical feature tagger for African American English

Rahul Porwal Alice Rozet Pryce Houck Jotsna Gowda Sarah Moeller +1 lainnya

Lihat Sumber

Abstrak

African American English (AAE) presents unique challenges in natural language processing (NLP). This research systematically compares the performance of available NLP models--rule-based, transformer-based, and large language models (LLMs)--capable of identifying key grammatical features of AAE, namely Habitual Be and Multiple Negation. These features were selected for their distinct grammatical complexity and frequency of occurrence. The evaluation involved sentence-level binary classification tasks, using both zero-shot and few-shot strategies. The analysis reveals that while LLMs show promise compared to the baseline, they are influenced by biases such as recency and unrelated features in the text such as formality. This study highlights the necessity for improved model training and architectural adjustments to better accommodate AAE's unique linguistic characteristics. Data and code are available.

Topik & Kata Kunci

cs.CL cs.AI cs.LG

Penulis (6)

Rahul Porwal

Alice Rozet

Pryce Houck

Jotsna Gowda

Sarah Moeller

Kevin Tang

Format Sitasi

APA MLA BibTeX

Porwal, R., Rozet, A., Houck, P., Gowda, J., Moeller, S., Tang, K. (2025). Analysis of LLM as a grammatical feature tagger for African American English. https://arxiv.org/abs/2502.06004

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓