Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

CAMDNN: Content-Aware Mapping of a Network of Deep Neural Networks on Edge MPSoCs

Full metadata record
DC Field Value Language
dc.contributor.authorHeidari, Soroush-
dc.contributor.authorGhasemi, Mehdi-
dc.contributor.authorKim, Young Geun-
dc.contributor.authorWu, Carole-Jean-
dc.contributor.authorVrudhula, Sarma-
dc.date.accessioned2022-12-08T12:42:19Z-
dc.date.available2022-12-08T12:42:19Z-
dc.date.created2022-12-08-
dc.date.issued2022-12-01-
dc.identifier.issn0018-9340-
dc.identifier.urihttps://scholar.korea.ac.kr/handle/2021.sw.korea/146479-
dc.description.abstractMachine Learning (ML) workloads are increasingly deployed at the edge. Enabling efficient inference execution while considering model and system heterogeneity remains challenging, especially for ML tasks built with a network of deep neural networks (DNNs). The challenge is to maximize the utilization of all available resources on the multiprocessor system on a chip (MPSoC) at the same time. This becomes even more complicated because the optimal mapping for the network of DNNs can vary with input batch sizes and scene complexity. In this paper, a holistic hierarchical scheduling framework is presented to optimize the execution time for a network of DNN models on an edge MPSoC at runtime, considering varying input characteristics. The framework consists of a local and a global scheduler. The local scheduler maps individual DNNs in the inference pipeline to the best-performing hardware unit while the global scheduler customizes an Integer Linear Programming (ILP) solution to instantiate DNN remapping. To minimize scheduler runtime overhead, an imitation learning (IL) based scheduler is used that approximates the ILP solutions. The proposed scheduling framework (CAMDNN) was implemented on a Qualcomm Robotic RB5 platform. CAMDNN resulted in lower execution time of up to 32% than heterogeneous earliest finish time, and by factors of 6.67X, 5.6X and 2.17X than the CPU-only, GPU-only and Central Queue schedulers.-
dc.languageEnglish-
dc.language.isoen-
dc.publisherIEEE COMPUTER SOC-
dc.titleCAMDNN: Content-Aware Mapping of a Network of Deep Neural Networks on Edge MPSoCs-
dc.typeArticle-
dc.contributor.affiliatedAuthorKim, Young Geun-
dc.identifier.doi10.1109/TC.2022.3207137-
dc.identifier.scopusid2-s2.0-85139409241-
dc.identifier.wosid000886309300011-
dc.identifier.bibliographicCitationIEEE TRANSACTIONS ON COMPUTERS, v.71, no.12, pp.3191 - 3202-
dc.relation.isPartOfIEEE TRANSACTIONS ON COMPUTERS-
dc.citation.titleIEEE TRANSACTIONS ON COMPUTERS-
dc.citation.volume71-
dc.citation.number12-
dc.citation.startPage3191-
dc.citation.endPage3202-
dc.type.rimsART-
dc.type.docTypeArticle-
dc.description.journalClass1-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryComputer Science, Hardware & Architecture-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.subject.keywordAuthorMachine learning-
dc.subject.keywordAuthorscheduling-
dc.subject.keywordAuthoredge-
dc.subject.keywordAuthorIoT-
dc.subject.keywordAuthordeep neural networks-
dc.subject.keywordAuthorDNN serving-
Files in This Item
There are no files associated with this item.
Appears in
Collections
Graduate School > Department of Computer Science and Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Young Geun photo

Kim, Young Geun
대학원 (컴퓨터학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE