arXiv Open Access 2026

Responsible Evaluation of AI for Mental Health

Hiba Arnaout Anmol Goel H. Andrew Schwartz Steffen T. Eberhardt Dana Atzil-Slonim +11 lainnya

Lihat Sumber

Abstrak

Although artificial intelligence (AI) shows growing promise for mental health care, current approaches to evaluating AI tools in this domain remain fragmented and poorly aligned with clinical practice, social context, and first-hand user experience. This paper argues for a rethinking of responsible evaluation -- what is measured, by whom, and for what purpose -- by introducing an interdisciplinary framework that integrates clinical soundness, social context, and equity, providing a structured basis for evaluation. Through an analysis of 135 recent *CL publications, we identify recurring limitations, including over-reliance on generic metrics that do not capture clinical validity, therapeutic appropriateness, or user experience, limited participation from mental health professionals, and insufficient attention to safety and equity. To address these gaps, we propose a taxonomy of AI mental health support types -- assessment-, intervention-, and information synthesis-oriented -- each with distinct risks and evaluative requirements, and illustrate its use through case studies.

Topik & Kata Kunci

cs.CY cs.AI

Penulis (16)

Hiba Arnaout

Anmol Goel

H. Andrew Schwartz

Steffen T. Eberhardt

Dana Atzil-Slonim

Gavin Doherty

Brian Schwartz

Wolfgang Lutz

Tim Althoff

Munmun De Choudhury

Hamidreza Jamalabadi

Raj Sanjay Shah

Flor Miriam Plaza-del-Arco

Dirk Hovy

Maria Liakata

Iryna Gurevych

Format Sitasi

APA MLA BibTeX

Arnaout, H., Goel, A., Schwartz, H.A., Eberhardt, S.T., Atzil-Slonim, D., Doherty, G. et al. (2026). Responsible Evaluation of AI for Mental Health. https://arxiv.org/abs/2602.00065

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2026
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓