arXiv Open Access 2025

Analysis of LLM as a grammatical feature tagger for African American English

Rahul Porwal Alice Rozet Pryce Houck Jotsna Gowda Sarah Moeller +1 lainnya
Lihat Sumber

Abstrak

African American English (AAE) presents unique challenges in natural language processing (NLP). This research systematically compares the performance of available NLP models--rule-based, transformer-based, and large language models (LLMs)--capable of identifying key grammatical features of AAE, namely Habitual Be and Multiple Negation. These features were selected for their distinct grammatical complexity and frequency of occurrence. The evaluation involved sentence-level binary classification tasks, using both zero-shot and few-shot strategies. The analysis reveals that while LLMs show promise compared to the baseline, they are influenced by biases such as recency and unrelated features in the text such as formality. This study highlights the necessity for improved model training and architectural adjustments to better accommodate AAE's unique linguistic characteristics. Data and code are available.

Topik & Kata Kunci

Penulis (6)

R

Rahul Porwal

A

Alice Rozet

P

Pryce Houck

J

Jotsna Gowda

S

Sarah Moeller

K

Kevin Tang

Format Sitasi

Porwal, R., Rozet, A., Houck, P., Gowda, J., Moeller, S., Tang, K. (2025). Analysis of LLM as a grammatical feature tagger for African American English. https://arxiv.org/abs/2502.06004

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓