Sensitivity-Based Error Resilient Techniques With Heterogeneous Multiply-Accumulate Unit for Voltage Scalable Deep Neural Network Accelerators

Shin, Dongyeob; Choi, Wonseok; Park, Jongsun; Ghosh, Swaroop

doi:10.1109/JETCAS.2019.2933862

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Sensitivity-Based Error Resilient Techniques With Heterogeneous Multiply-Accumulate Unit for Voltage Scalable Deep Neural Network Accelerators

Authors: Shin, Dongyeob; Choi, Wonseok; Park, Jongsun; Ghosh, Swaroop

Issue Date: Sep-2019

Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords: Deep neural network resilience; timing error resilient accelerator; voltage scaling

Citation: IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, v.9, no.3, pp.520 - 531

Indexed: SCIE
SCOPUS

Journal Title: IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS

Volume: 9

Number: 3

Start Page: 520

End Page: 531

URI: https://scholar.korea.ac.kr/handle/2021.sw.korea/63051

DOI: 10.1109/JETCAS.2019.2933862

ISSN: 2156-3357

Abstract: With inherent algorithmic error resilience of deep neural networks (DNNs), supply voltage scaling could be a promising technique for energy efficient DNN accelerator design. In this paper, we present an error resilient technique to enable aggressive voltage scaling by exploiting the asymmetric error resilience (sensitivity) with respect to DNN layers, filters, and channels. First-order Taylor expansion is used to evaluate the filter/channel-level weight sensitivities of large scale DNNs which accurately approximates weight sensitivities from actual error injection simulations. We also present the heterogeneous multiply-accumulate (MAC) unit based design approach where some of the MAC units are designed larger with shorter critical path delays for robustness to aggressive voltage scaling while other MAC units are designed relatively smaller. The sensitivity variations among filter weights can be leveraged to design DNN accelerator such that the computations with more sensitive weights are assigned to more robust (larger) MAC units while the computations with less sensitive weights are assigned to less robust (smaller) MAC units. Using dynamic programming, the sizes of MAC units are selected to achieve best DNN accuracy under ISO area constraint. As a result, the proposed voltage scalable DNN accelerator can achieve 34% energy savings in post layout simulations using 65 nm CMOS process with ImageNet dataset using ResNet-18 compared to state-of-the-art timing error recovery technique.

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Engineering > School of Electrical Engineering > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Park, Jong sun photo

Park, Jong sun: College of Engineering (School of Electrical Engineering)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :6,997,638; Today View :9,591

RSS_1.0 RSS_2.0 ATOM_1.0

145 Anam-ro, Seongbuk-gu, Seoul, 02841, Korea+82-2-3290-2963

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE