Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Quant-PIM: An Energy-Efficient Processing-in-Memory Accelerator for Layerwise Quantized Neural Networks

Full metadata record
DC Field Value Language
dc.contributor.authorLee, Young Seo-
dc.contributor.authorChung, Eui-Young-
dc.contributor.authorGong, Young-Ho-
dc.contributor.authorChung, Sung Woo-
dc.date.accessioned2022-02-13T12:40:33Z-
dc.date.available2022-02-13T12:40:33Z-
dc.date.created2022-01-20-
dc.date.issued2021-12-
dc.identifier.issn1943-0663-
dc.identifier.urihttps://scholar.korea.ac.kr/handle/2021.sw.korea/135621-
dc.description.abstractLayerwise quantized neural networks (QNNs), which adopt different precisions for weights or activations in a layerwise manner, have emerged as a promising approach for embedded systems. The layerwise QNNs deploy only required number of data bits for the computation (e.g., convolution of weights and activations), which in turn reduces computation energy compared to the conventional QNNs. However, the layerwise QNNs still cause a large amount of energy in the conventional memory systems, since memory accesses are not optimized for the required precision of each layer. To address this problem, we propose Quant-PIM, an energy-efficient processing-in-memory (PIM) accelerator for layerwise QNNs. Quant-PIM selectively reads only required data bits within a data word depending on the precision, by deploying the modified I/O gating logics in a 3-D stacked memory. Thus, Quant-PIM significantly reduces energy consumption for memory accesses. In addition, Quant-PIM improves the performance of layerwise QNNs. When the required precision is half of the weight (or activation) size or less, Quant-PIM reads two data blocks in a single read operation by exploiting the saved memory bandwidth from the selective memory access, thus providing higher compute-throughput. Our simulation results show that Quant-PIM reduces system energy by 39.1%similar to 50.4% compared to the PIM system with 16-bit quantized precision, without accuracy loss.-
dc.languageEnglish-
dc.language.isoen-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleQuant-PIM: An Energy-Efficient Processing-in-Memory Accelerator for Layerwise Quantized Neural Networks-
dc.typeArticle-
dc.contributor.affiliatedAuthorChung, Sung Woo-
dc.identifier.doi10.1109/LES.2021.3050253-
dc.identifier.scopusid2-s2.0-85099546052-
dc.identifier.wosid000721999200007-
dc.identifier.bibliographicCitationIEEE EMBEDDED SYSTEMS LETTERS, v.13, no.4, pp.162 - 165-
dc.relation.isPartOfIEEE EMBEDDED SYSTEMS LETTERS-
dc.citation.titleIEEE EMBEDDED SYSTEMS LETTERS-
dc.citation.volume13-
dc.citation.number4-
dc.citation.startPage162-
dc.citation.endPage165-
dc.type.rimsART-
dc.type.docTypeArticle-
dc.description.journalClass1-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryComputer Science, Hardware & Architecture-
dc.relation.journalWebOfScienceCategoryComputer Science, Software Engineering-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.subject.keywordAuthorAccelerator-
dc.subject.keywordAuthorenergy efficiency-
dc.subject.keywordAuthorlayerwise quantization-
dc.subject.keywordAuthorprocessing-in-memory (PIM)-
dc.subject.keywordAuthorquantized neural network (QNN)-
Files in This Item
There are no files associated with this item.
Appears in
Collections
Graduate School > Department of Computer Science and Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE