Semantic Scholar Open Access 2020 484 sitasi

SIGMA: A Sparse and Irregular GEMM Accelerator with Flexible Interconnects for DNN Training

Eric Qin A. Samajdar Hyoukjun Kwon V. Nadella S. Srinivasan +3 lainnya

Abstrak

The advent of Deep Learning (DL) has radically transformed the computing industry across the entire spectrum from algorithms to circuits. As myriad application domains embrace DL, it has become synonymous with a genre of workloads across vision, speech, language, recommendations, robotics, and games. The key compute kernel within most DL workloads is general matrix-matrix multiplications (GEMMs), which appears frequently during both the forward pass (inference and training) and backward pass (training). GEMMs are a natural choice for hardware acceleration to speed up training, and have led to 2D systolic architectures like NVIDIA tensor cores and Google Tensor Processing Unit (TPU). Unfortunately, emerging GEMMs in DL are highly irregular and sparse, which lead to poor data mappings on systolic architectures. This paper proposes SIGMA, a flexible and scalable architecture that offers high utilization of all its processing elements (PEs) regardless of kernel shape and sparsity. Within SIGMA includes a novel reduction tree microarchitecture named Forwarding Adder Network (FAN). SIGMA performs 5.7x better than systolic array architectures for irregular sparse matrices, and roughly 3x better than state-of-the-art sparse accelerators. We demonstrate an instance of SIGMA operating at 10.8 TFLOPS efficiency across arbitrary levels of sparsity, with a 65.10 mm^2 and 22.33 W footprint on a 28 nm process.

Topik & Kata Kunci

Penulis (8)

E

Eric Qin

A

A. Samajdar

H

Hyoukjun Kwon

V

V. Nadella

S

S. Srinivasan

D

Dipankar Das

B

Bharat Kaul

T

T. Krishna

Format Sitasi

Qin, E., Samajdar, A., Kwon, H., Nadella, V., Srinivasan, S., Das, D. et al. (2020). SIGMA: A Sparse and Irregular GEMM Accelerator with Flexible Interconnects for DNN Training. https://doi.org/10.1109/HPCA47549.2020.00015

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.1109/HPCA47549.2020.00015
Informasi Jurnal
Tahun Terbit
2020
Bahasa
en
Total Sitasi
484×
Sumber Database
Semantic Scholar
DOI
10.1109/HPCA47549.2020.00015
Akses
Open Access ✓