Semantic Scholar Open Access 2019 52 sitasi

Genomics and data science: an application within an umbrella

Fábio C. P. Navarro Hussein Mohsen Chengfei Yan Shantao Li Mengting Gu +2 lainnya

Abstrak

Data science allows the extraction of practical insights from large-scale data. Here, we contextualize it as an umbrella term, encompassing several disparate subdomains. We focus on how genomics fits as a specific application subdomain, in terms of well-known 3 V data and 4 M process frameworks (volume-velocity-variety and measurement-mining-modeling-manipulation, respectively). We further analyze the technical and cultural “exports” and “imports” between genomics and other data-science subdomains (e.g., astronomy). Finally, we discuss how data value, privacy, and ownership are pressing issues for data science applications, in general, and are especially relevant to genomics, due to the persistent nature of DNA.

Topik & Kata Kunci

Penulis (7)

F

Fábio C. P. Navarro

H

Hussein Mohsen

C

Chengfei Yan

S

Shantao Li

M

Mengting Gu

W

W. Meyerson

M

M. Gerstein

Format Sitasi

Navarro, F.C.P., Mohsen, H., Yan, C., Li, S., Gu, M., Meyerson, W. et al. (2019). Genomics and data science: an application within an umbrella. https://doi.org/10.1186/s13059-019-1724-1

Akses Cepat

Lihat di Sumber doi.org/10.1186/s13059-019-1724-1
Informasi Jurnal
Tahun Terbit
2019
Bahasa
en
Total Sitasi
52×
Sumber Database
Semantic Scholar
DOI
10.1186/s13059-019-1724-1
Akses
Open Access ✓