arXiv Open Access 2025

Predicting Effects, Missing Distributions: Evaluating LLMs as Human Behavior Simulators in Operations Management

Runze Zhang Xiaowei Zhang Mingyang Zhao

Lihat Sumber

Abstrak

LLMs are emerging tools for simulating human behavior in business, economics, and social science, offering a lower-cost complement to laboratory experiments, field studies, and surveys. This paper evaluates how well LLMs replicate human behavior in operations management. Using nine published experiments in behavioral operations, we assess two criteria: replication of hypothesis-test outcomes and distributional alignment via Wasserstein distance. LLMs reproduce most hypothesis-level effects, capturing key decision biases, but their response distributions diverge from human data, including for strong commercial models. We also test two lightweight interventions -- chain-of-thought prompting and hyperparameter tuning -- which reduce misalignment and can sometimes let smaller or open-source models match or surpass larger systems.

Topik & Kata Kunci

cs.LG cs.AI

Penulis (3)

Runze Zhang

Xiaowei Zhang

Mingyang Zhao

Format Sitasi

APA MLA BibTeX

Zhang, R., Zhang, X., Zhao, M. (2025). Predicting Effects, Missing Distributions: Evaluating LLMs as Human Behavior Simulators in Operations Management. https://arxiv.org/abs/2510.03310

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓