Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Exploiting Retraining-Based Mixed-Precision Quantization for Low-Cost DNN Accelerator Design

Authors
Kim, NahsungShin, DongyeobChoi, WonseokKim, GeonhoPark, Jongsun
Issue Date
7월-2021
Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Keywords
Deep neural network (DNN) accelerator; energy-efficient accelerator; model compression; quantization
Citation
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, v.32, no.7, pp.2925 - 2938
Indexed
SCIE
SCOPUS
Journal Title
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
Volume
32
Number
7
Start Page
2925
End Page
2938
URI
https://scholar.korea.ac.kr/handle/2021.sw.korea/127783
DOI
10.1109/TNNLS.2020.3008996
ISSN
2162-237X
Abstract
For successful deployment of deep neural networks (DNNs) on resource-constrained devices, retraining-based quantization has been widely adopted to reduce the number of DRAM accesses. By properly setting training parameters, such as batch size and learning rate, bit widths of both weights and activations can be uniformly quantized down to 4 bit while maintaining full precision accuracy. In this article, we present a retraining-based mixed-precision quantization approach and its customized DNN accelerator to achieve high energy efficiency. In the proposed quantization, in the middle of retraining, an additional bit (extra quantization level) is assigned to the weights that have shown frequent switching between two contiguous quantization levels since it means that both quantization levels cannot help to reduce quantization loss. We also mitigate the gradient noise that occurs in the retraining process by taking a lower learning rate near the quantization threshold. For the proposed novel mixed-precision quantized network (MPQ-network), we have implemented a customized accelerator using a 65-nm CMOS process. In the accelerator, the proposed processing elements (PEs) can be dynamically reconfigured to process variable bit widths from 2 to 4 bit for both weights and activations. The numerical results show that the proposed quantization can achieve 1.37x better compression ratio for VGG-9 using CIFAR-10 data set compared with a uniform 4-bit (both weights and activations) model without loss of classification accuracy. The proposed accelerator also shows 1.29x of energy savings for VGG-9 using the CIFAR-10 data set over the state-of-the-art accelerator.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Engineering > School of Electrical Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Park, Jong sun photo

Park, Jong sun
공과대학 (전기전자공학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE