Semantic Scholar Open Access 2019 300 sitasi

How Data Science Workers Work with Data: Discovery, Capture, Curation, Design, Creation

Michael J. Muller Ingrid Lange Dakuo Wang David Piorkowski Jason Tsay +3 lainnya

Abstrak

With the rise of big data, there has been an increasing need for practitioners in this space and an increasing opportunity for researchers to understand their workflows and design new tools to improve it. Data science is often described as data-driven, comprising unambiguous data and proceeding through regularized steps of analysis. However, this view focuses more on abstract processes, pipelines, and workflows, and less on how data science workers engage with the data. In this paper, we build on the work of other CSCW and HCI researchers in describing the ways that scientists, scholars, engineers, and others work with their data, through analyses of interviews with 21 data science professionals. We set five approaches to data along a dimension of interventions: Data as given; as captured; as curated; as designed; and as created. Data science workers develop an intuitive sense of their data and processes, and actively shape their data. We propose new ways to apply these interventions analytically, to make sense of the complex activities around data practices.

Topik & Kata Kunci

Penulis (8)

M

Michael J. Muller

I

Ingrid Lange

D

Dakuo Wang

D

David Piorkowski

J

Jason Tsay

Q

Q. Liao

C

Casey Dugan

T

T. Erickson

Format Sitasi

Muller, M.J., Lange, I., Wang, D., Piorkowski, D., Tsay, J., Liao, Q. et al. (2019). How Data Science Workers Work with Data: Discovery, Capture, Curation, Design, Creation. https://doi.org/10.1145/3290605.3300356

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.1145/3290605.3300356
Informasi Jurnal
Tahun Terbit
2019
Bahasa
en
Total Sitasi
300×
Sumber Database
Semantic Scholar
DOI
10.1145/3290605.3300356
Akses
Open Access ✓