arXiv Open Access 2026

PsychEval: A Multi-Session and Multi-Therapy Benchmark for High-Realism AI Psychological Counselor

Qianjun Pan Junyi Wang Jie Zhou Yutao Yang Junsong Li +8 lainnya
Lihat Sumber

Abstrak

To develop a reliable AI for psychological assessment, we introduce \texttt{PsychEval}, a multi-session, multi-therapy, and highly realistic benchmark designed to address three key challenges: \textbf{1) Can we train a highly realistic AI counselor?} Realistic counseling is a longitudinal task requiring sustained memory and dynamic goal tracking. We propose a multi-session benchmark (spanning 6-10 sessions across three distinct stages) that demands critical capabilities such as memory continuity, adaptive reasoning, and longitudinal planning. The dataset is annotated with extensive professional skills, comprising over 677 meta-skills and 4577 atomic skills. \textbf{2) How to train a multi-therapy AI counselor?} While existing models often focus on a single therapy, complex cases frequently require flexible strategies among various therapies. We construct a diverse dataset covering five therapeutic modalities (Psychodynamic, Behaviorism, CBT, Humanistic Existentialist, and Postmodernist) alongside an integrative therapy with a unified three-stage clinical framework across six core psychological topics. \textbf{3) How to systematically evaluate an AI counselor?} We establish a holistic evaluation framework with 18 therapy-specific and therapy-shared metrics across Client-Level and Counselor-Level dimensions. To support this, we also construct over 2,000 diverse client profiles. Extensive experimental analysis fully validates the superior quality and clinical fidelity of our dataset. Crucially, \texttt{PsychEval} transcends static benchmarking to serve as a high-fidelity reinforcement learning environment that enables the self-evolutionary training of clinically responsible and adaptive AI counselors.

Topik & Kata Kunci

Penulis (13)

Q

Qianjun Pan

J

Junyi Wang

J

Jie Zhou

Y

Yutao Yang

J

Junsong Li

K

Kaiyin Xu

Y

Yougen Zhou

Y

Yihan Li

J

Jingyuan Zhao

Q

Qin Chen

N

Ningning Zhou

K

Kai Chen

L

Liang He

Format Sitasi

Pan, Q., Wang, J., Zhou, J., Yang, Y., Li, J., Xu, K. et al. (2026). PsychEval: A Multi-Session and Multi-Therapy Benchmark for High-Realism AI Psychological Counselor. https://arxiv.org/abs/2601.01802

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2026
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓