Semantic Scholar Open Access 2019 322 sitasi

The Architectural Implications of Facebook's DNN-Based Personalized Recommendation

Udit Gupta Xiaodong Wang M. Naumov Carole-Jean Wu Brandon Reagen +10 lainnya

Abstrak

The widespread application of deep learning has changed the landscape of computation in data centers. In particular, personalized recommendation for content ranking is now largely accomplished using deep neural networks. However, despite their importance and the amount of compute cycles they consume, relatively little research attention has been devoted to recommendation systems. To facilitate research and advance the understanding of these workloads, this paper presents a set of real-world, production-scale DNNs for personalized recommendation coupled with relevant performance metrics for evaluation. In addition to releasing a set of open-source workloads, we conduct in-depth analysis that underpins future system design and optimization for at-scale recommendation: Inference latency varies by 60% across three Intel server generations, batching and co-location of inference jobs can drastically improve latency-bounded throughput, and diversity across recommendation models leads to different optimization strategies.

Topik & Kata Kunci

Penulis (15)

U

Udit Gupta

X

Xiaodong Wang

M

M. Naumov

C

Carole-Jean Wu

B

Brandon Reagen

D

D. Brooks

B

Bradford Cottel

K

K. Hazelwood

B

Bill Jia

H

Hsien-Hsin S. Lee

A

A. Malevich

D

Dheevatsa Mudigere

M

M. Smelyanskiy

L

Liang Xiong

X

Xuan Zhang

Format Sitasi

Gupta, U., Wang, X., Naumov, M., Wu, C., Reagen, B., Brooks, D. et al. (2019). The Architectural Implications of Facebook's DNN-Based Personalized Recommendation. https://doi.org/10.1109/HPCA47549.2020.00047

Akses Cepat

Informasi Jurnal
Tahun Terbit
2019
Bahasa
en
Total Sitasi
322×
Sumber Database
Semantic Scholar
DOI
10.1109/HPCA47549.2020.00047
Akses
Open Access ✓