Semantic Scholar Open Access 2021 116 sitasi

Deep Reinforcement Learning Based Resource Management for DNN Inference in Industrial IoT

Weiting Zhang Dong Yang Hai-xia Peng Wen Wu Wei Quan +2 lainnya

Abstrak

Performing deep neural network (DNN) inference in real time requires excessive network resources, which poses a big challenge to the resource-limited industrial Internet of things (IIoT) networks. To address the challenge, in this paper, we introduce an end-edge-cloud orchestration architecture, in which the inference task assignment and DNN model placement are flexibly coordinated. Specifically, the DNN models, trained and pre-stored in the cloud, are properly placed at the end and edge to perform DNN inference. To achieve efficient DNN inference, a multi-dimensional resource management problem is formulated to maximize the average inference accuracy while satisfying the strict delay requirements of inference tasks. Due to the mix-integer decision variables, it is difficult to solve the formulated problem directly. Thus, we transform the formulated problem into a Markov decision process which can be solved efficiently. Furthermore, a deep reinforcement learning based resource management scheme is proposed to make real-time optimal resource allocation decisions. Simulation results are provided to demonstrate that the proposed scheme can efficiently allocate the available spectrum, caching, and computing resources, and improve average inference accuracy by 31.4$\%$ compared with the deep deterministic policy gradient benchmark.

Topik & Kata Kunci

Penulis (7)

W

Weiting Zhang

D

Dong Yang

H

Hai-xia Peng

W

Wen Wu

W

Wei Quan

H

Hongke Zhang

X

X. Shen

Format Sitasi

Zhang, W., Yang, D., Peng, H., Wu, W., Quan, W., Zhang, H. et al. (2021). Deep Reinforcement Learning Based Resource Management for DNN Inference in Industrial IoT. https://doi.org/10.1109/TVT.2021.3068255

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.1109/TVT.2021.3068255
Informasi Jurnal
Tahun Terbit
2021
Bahasa
en
Total Sitasi
116×
Sumber Database
Semantic Scholar
DOI
10.1109/TVT.2021.3068255
Akses
Open Access ✓