arXiv
Open Access
2025
Sycophancy Claims about Language Models: The Missing Human-in-the-Loop
Jan Batzner
Volker Stocker
Stefan Schmid
Gjergji Kasneci
Abstrak
Sycophantic response patterns in Large Language Models (LLMs) have been increasingly claimed in the literature. We review methodological challenges in measuring LLM sycophancy and identify five core operationalizations. Despite sycophancy being inherently human-centric, current research does not evaluate human perception. Our analysis highlights the difficulties in distinguishing sycophantic responses from related concepts in AI alignment and offers actionable recommendations for future research.
Penulis (4)
J
Jan Batzner
V
Volker Stocker
S
Stefan Schmid
G
Gjergji Kasneci
Akses Cepat
Informasi Jurnal
- Tahun Terbit
- 2025
- Bahasa
- en
- Sumber Database
- arXiv
- Akses
- Open Access ✓