arXiv Open Access 2025

UserSimCRS v2: Simulation-Based Evaluation for Conversational Recommender Systems

Nolwenn Bernard Krisztian Balog
Lihat Sumber

Abstrak

Resources for simulation-based evaluation of conversational recommender systems (CRSs) are scarce. The UserSimCRS toolkit was introduced to address this gap. In this work, we present UserSimCRS v2, a significant upgrade aligning the toolkit with state-of-the-art research. Key extensions include an enhanced agenda-based user simulator, introduction of large language model-based simulators, integration for a wider range of CRSs and datasets, and new LLM-as-a-judge evaluation utilities. We demonstrate these extensions in a case study.

Topik & Kata Kunci

Penulis (2)

N

Nolwenn Bernard

K

Krisztian Balog

Format Sitasi

Bernard, N., Balog, K. (2025). UserSimCRS v2: Simulation-Based Evaluation for Conversational Recommender Systems. https://arxiv.org/abs/2512.04588

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓