Semantic Scholar Open Access 2023 492 sitasi

Can Large Language Models Transform Computational Social Science?

Caleb Ziems William B. Held Omar Shaikh Jiaao Chen Zhehao Zhang +1 lainnya

Abstrak

Large language models (LLMs) are capable of successfully performing many language processing tasks zero-shot (without training data). If zero-shot LLMs can also reliably classify and explain social phenomena like persuasiveness and political ideology, then LLMs could augment the computational social science (CSS) pipeline in important ways. This work provides a road map for using LLMs as CSS tools. Towards this end, we contribute a set of prompting best practices and an extensive evaluation pipeline to measure the zero-shot performance of 13 language models on 25 representative English CSS benchmarks. On taxonomic labeling tasks (classification), LLMs fail to outperform the best fine-tuned models but still achieve fair levels of agreement with humans. On free-form coding tasks (generation), LLMs produce explanations that often exceed the quality of crowdworkers’ gold references. We conclude that the performance of today’s LLMs can augment the CSS research pipeline in two ways: (1) serving as zero-shot data annotators on human annotation teams, and (2) bootstrapping challenging creative generation tasks (e.g., explaining the underlying attributes of a text). In summary, LLMs are posed to meaningfully participate in social science analysis in partnership with humans.

Topik & Kata Kunci

Penulis (6)

C

Caleb Ziems

W

William B. Held

O

Omar Shaikh

J

Jiaao Chen

Z

Zhehao Zhang

D

Diyi Yang

Format Sitasi

Ziems, C., Held, W.B., Shaikh, O., Chen, J., Zhang, Z., Yang, D. (2023). Can Large Language Models Transform Computational Social Science?. https://doi.org/10.1162/coli_a_00502

Akses Cepat

Lihat di Sumber doi.org/10.1162/coli_a_00502
Informasi Jurnal
Tahun Terbit
2023
Bahasa
en
Total Sitasi
492×
Sumber Database
Semantic Scholar
DOI
10.1162/coli_a_00502
Akses
Open Access ✓