arXiv Open Access 2024

Is ETHICS about ethics? Evaluating the ETHICS benchmark

Leif Hancox-Li Borhane Blili-Hamelin

Lihat Sumber

Abstrak

ETHICS is probably the most-cited dataset for testing the ethical capabilities of language models. Drawing on moral theory, psychology, and prompt evaluation, we interrogate the validity of the ETHICS benchmark. Adding to prior work, our findings suggest that having a clear understanding of ethics and how it relates to empirical phenomena is key to the validity of ethics evaluations for AI.

Topik & Kata Kunci

cs.CY

Penulis (2)

Leif Hancox-Li

Borhane Blili-Hamelin

Format Sitasi

APA MLA BibTeX

Hancox-Li, L., Blili-Hamelin, B. (2024). Is ETHICS about ethics? Evaluating the ETHICS benchmark. https://arxiv.org/abs/2410.13009

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2024
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓