Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

CAMDNN: Content-Aware Mapping of a Network of Deep Neural Networks on Edge MPSoCs

Authors
Heidari, SoroushGhasemi, MehdiKim, Young GeunWu, Carole-JeanVrudhula, Sarma
Issue Date
1-12월-2022
Publisher
IEEE COMPUTER SOC
Keywords
Machine learning; scheduling; edge; IoT; deep neural networks; DNN serving
Citation
IEEE TRANSACTIONS ON COMPUTERS, v.71, no.12, pp.3191 - 3202
Indexed
SCIE
SCOPUS
Journal Title
IEEE TRANSACTIONS ON COMPUTERS
Volume
71
Number
12
Start Page
3191
End Page
3202
URI
https://scholar.korea.ac.kr/handle/2021.sw.korea/146479
DOI
10.1109/TC.2022.3207137
ISSN
0018-9340
Abstract
Machine Learning (ML) workloads are increasingly deployed at the edge. Enabling efficient inference execution while considering model and system heterogeneity remains challenging, especially for ML tasks built with a network of deep neural networks (DNNs). The challenge is to maximize the utilization of all available resources on the multiprocessor system on a chip (MPSoC) at the same time. This becomes even more complicated because the optimal mapping for the network of DNNs can vary with input batch sizes and scene complexity. In this paper, a holistic hierarchical scheduling framework is presented to optimize the execution time for a network of DNN models on an edge MPSoC at runtime, considering varying input characteristics. The framework consists of a local and a global scheduler. The local scheduler maps individual DNNs in the inference pipeline to the best-performing hardware unit while the global scheduler customizes an Integer Linear Programming (ILP) solution to instantiate DNN remapping. To minimize scheduler runtime overhead, an imitation learning (IL) based scheduler is used that approximates the ILP solutions. The proposed scheduling framework (CAMDNN) was implemented on a Qualcomm Robotic RB5 platform. CAMDNN resulted in lower execution time of up to 32% than heterogeneous earliest finish time, and by factors of 6.67X, 5.6X and 2.17X than the CPU-only, GPU-only and Central Queue schedulers.
Files in This Item
There are no files associated with this item.
Appears in
Collections
Graduate School > Department of Computer Science and Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Young Geun photo

Kim, Young Geun
대학원 (컴퓨터학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE