Semantic Scholar Open Access 2022 884 sitasi

Taxonomy of Risks posed by Language Models

Laura Weidinger Jonathan Uesato Maribeth Rauh Conor Griffin Po-Sen Huang +18 lainnya

Abstrak

Responsible innovation on large-scale Language Models (LMs) requires foresight into and in-depth understanding of the risks these models may pose. This paper develops a comprehensive taxonomy of ethical and social risks associated with LMs. We identify twenty-one risks, drawing on expertise and literature from computer science, linguistics, and the social sciences. We situate these risks in our taxonomy of six risk areas: I. Discrimination, Hate speech and Exclusion, II. Information Hazards, III. Misinformation Harms, IV. Malicious Uses, V. Human-Computer Interaction Harms, and VI. Environmental and Socioeconomic harms. For risks that have already been observed in LMs, the causal mechanism leading to harm, evidence of the risk, and approaches to risk mitigation are discussed. We further describe and analyse risks that have not yet been observed but are anticipated based on assessments of other language technologies, and situate these in the same taxonomy. We underscore that it is the responsibility of organizations to engage with the mitigations we discuss throughout the paper. We close by highlighting challenges and directions for further research on risk evaluation and mitigation with the goal of ensuring that language models are developed responsibly.

Topik & Kata Kunci

Penulis (23)

L

Laura Weidinger

J

Jonathan Uesato

M

Maribeth Rauh

C

Conor Griffin

P

Po-Sen Huang

J

John F. J. Mellor

A

Amelia Glaese

M

Myra Cheng

B

Borja Balle

A

Atoosa Kasirzadeh

C

Courtney Biles

S

S. Brown

Z

Zachary Kenton

W

W. Hawkins

T

T. Stepleton

A

Abeba Birhane

L

Lisa Anne Hendricks

L

Laura Rimell

W

William S. Isaac

J

Julia Haas

S

Sean Legassick

G

G. Irving

I

Iason Gabriel

Format Sitasi

Weidinger, L., Uesato, J., Rauh, M., Griffin, C., Huang, P., Mellor, J.F.J. et al. (2022). Taxonomy of Risks posed by Language Models. https://doi.org/10.1145/3531146.3533088

Akses Cepat

Lihat di Sumber doi.org/10.1145/3531146.3533088
Informasi Jurnal
Tahun Terbit
2022
Bahasa
en
Total Sitasi
884×
Sumber Database
Semantic Scholar
DOI
10.1145/3531146.3533088
Akses
Open Access ✓