arXiv Open Access 2024

TRIAGE: Ethical Benchmarking of AI Models Through Mass Casualty Simulations

Nathalie Maria Kirch Konstantin Hebenstreit Matthias Samwald

Lihat Sumber

Abstrak

We present the TRIAGE Benchmark, a novel machine ethics (ME) benchmark that tests LLMs' ability to make ethical decisions during mass casualty incidents. It uses real-world ethical dilemmas with clear solutions designed by medical professionals, offering a more realistic alternative to annotation-based benchmarks. TRIAGE incorporates various prompting styles to evaluate model performance across different contexts. Most models consistently outperformed random guessing, suggesting LLMs may support decision-making in triage scenarios. Neutral or factual scenario formulations led to the best performance, unlike other ME benchmarks where ethical reminders improved outcomes. Adversarial prompts reduced performance but not to random guessing levels. Open-source models made more morally serious errors, and general capability overall predicted better performance.

Topik & Kata Kunci

cs.CY cs.AI

Penulis (3)

Nathalie Maria Kirch

Konstantin Hebenstreit

Matthias Samwald

Format Sitasi

APA MLA BibTeX

Kirch, N.M., Hebenstreit, K., Samwald, M. (2024). TRIAGE: Ethical Benchmarking of AI Models Through Mass Casualty Simulations. https://arxiv.org/abs/2410.18991

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2024
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓