Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Exploiting thread-level parallelism in lockstep execution by partially duplicating a single pipeline

Full metadata record
DC Field Value Language
dc.contributor.authorOh, Jaegeun-
dc.contributor.authorHwang, Seok Joong-
dc.contributor.authorNguyen, Huong Giang-
dc.contributor.authorKim, Areum-
dc.contributor.authorKim, Seon Wook-
dc.contributor.authorKim, Chulwoo-
dc.contributor.authorKim, Jong-Kook-
dc.date.accessioned2021-09-09T05:34:06Z-
dc.date.available2021-09-09T05:34:06Z-
dc.date.created2021-06-10-
dc.date.issued2008-08-
dc.identifier.issn1225-6463-
dc.identifier.urihttps://scholar.korea.ac.kr/handle/2021.sw.korea/122917-
dc.description.abstractin most parallel loops of embedded applications, every iteration executes the exact same sequence of instructions while manipulating different data. This fact motivates a new compiler-hardware orchestrated execution framework in which all parallel threads share one fetch unit and one decode unit but have their own execution, memory, and write-back units. This resource sharing enables parallel threads to execute in lockstep with minimal hardware extension and compiler support. Our proposed architecture, called multithreaded lockstep execution processor (MLEP), is a compromise between the single-instruction multiple-data (SIMD) and symmetric multithreading/chip multiprocessor (SMT/CMP) solutions. The proposed approach is more favorable than a typical SIMD execution in terms of degree of parallelism, range of applicability, and code generation, and can save more power and chip area than the SMT/CMP approach without significant performance degradation. For the architecture verification, we extend a commercial 32-bit embedded core AE32000C and synthesize it on Xilinx FPGA. Compared to the original architecture, our approach is 13.5% faster with a 2-way MLEP and 33.7% faster with a 4-way MLEP in EEMBC benchmarks which are automatically parallelized by the Intel compiler.-
dc.languageEnglish-
dc.language.isoen-
dc.publisherWILEY-
dc.titleExploiting thread-level parallelism in lockstep execution by partially duplicating a single pipeline-
dc.typeArticle-
dc.contributor.affiliatedAuthorKim, Seon Wook-
dc.contributor.affiliatedAuthorKim, Chulwoo-
dc.contributor.affiliatedAuthorKim, Jong-Kook-
dc.identifier.doi10.4218/etrij.08.0107.0343-
dc.identifier.scopusid2-s2.0-49449105358-
dc.identifier.wosid000258418400009-
dc.identifier.bibliographicCitationETRI JOURNAL, v.30, no.4, pp.576 - 586-
dc.relation.isPartOfETRI JOURNAL-
dc.citation.titleETRI JOURNAL-
dc.citation.volume30-
dc.citation.number4-
dc.citation.startPage576-
dc.citation.endPage586-
dc.type.rimsART-
dc.type.docTypeArticle-
dc.identifier.kciidART001269287-
dc.description.journalClass1-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.description.journalRegisteredClasskci-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.subject.keywordAuthorILP-
dc.subject.keywordAuthorTLP-
dc.subject.keywordAuthorSMT-
dc.subject.keywordAuthorCMP-
dc.subject.keywordAuthorMLEP-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Engineering > School of Electrical Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Jong Kook photo

Kim, Jong Kook
공과대학 (전기전자공학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE