arXiv Open Access 2024

Investigating Annotator Bias in Large Language Models for Hate Speech Detection

Amit Das Zheng Zhang Najib Hasan Souvika Sarkar Fatemeh Jamshidi +10 lainnya

Lihat Sumber

Abstrak

Data annotation, the practice of assigning descriptive labels to raw data, is pivotal in optimizing the performance of machine learning models. However, it is a resource-intensive process susceptible to biases introduced by annotators. The emergence of sophisticated Large Language Models (LLMs) presents a unique opportunity to modernize and streamline this complex procedure. While existing research extensively evaluates the efficacy of LLMs, as annotators, this paper delves into the biases present in LLMs when annotating hate speech data. Our research contributes to understanding biases in four key categories: gender, race, religion, and disability with four LLMs: GPT-3.5, GPT-4o, Llama-3.1 and Gemma-2. Specifically targeting highly vulnerable groups within these categories, we analyze annotator biases. Furthermore, we conduct a comprehensive examination of potential factors contributing to these biases by scrutinizing the annotated data. We introduce our custom hate speech detection dataset, HateBiasNet, to conduct this research. Additionally, we perform the same experiments on the ETHOS (Mollas et al. 2022) dataset also for comparative analysis. This paper serves as a crucial resource, guiding researchers and practitioners in harnessing the potential of LLMs for data annotation, thereby fostering advancements in this critical field.

Topik & Kata Kunci

cs.CL cs.AI cs.LG

Penulis (15)

Amit Das

Zheng Zhang

Najib Hasan

Souvika Sarkar

Fatemeh Jamshidi

Tathagata Bhattacharya

Mostafa Rahgouy

Nilanjana Raychawdhary

Dongji Feng

Vinija Jain

Aman Chadha

Mary Sandage

Lauramarie Pope

Gerry Dozier

Cheryl Seals

Format Sitasi

APA MLA BibTeX

Das, A., Zhang, Z., Hasan, N., Sarkar, S., Jamshidi, F., Bhattacharya, T. et al. (2024). Investigating Annotator Bias in Large Language Models for Hate Speech Detection. https://arxiv.org/abs/2406.11109

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2024
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓