A novel warp scheduling scheme considering long-latency operations for high-performance GPUs

Cong Thuan Do; Choi, Hong Jun; Chung, Sung Woo; Kim, Cheol Hong

doi:10.1007/s11227-019-03091-2

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

A novel warp scheduling scheme considering long-latency operations for high-performance GPUs

Authors: Cong Thuan Do; Choi, Hong Jun; Chung, Sung Woo; Kim, Cheol Hong

Issue Date: 4월-2020

Publisher: SPRINGER

Keywords: GPGPU; Performance; Memory latency; Utilization; Warp scheduling

Citation: JOURNAL OF SUPERCOMPUTING, v.76, no.4, pp.3043 - 3062

Indexed: SCIE
SCOPUS

Journal Title: JOURNAL OF SUPERCOMPUTING

Volume: 76

Number: 4

Start Page: 3043

End Page: 3062

URI: https://scholar.korea.ac.kr/handle/2021.sw.korea/56829

DOI: 10.1007/s11227-019-03091-2

ISSN: 0920-8542

Abstract: Graphics processing units (GPUs) have become one of the best platforms for exploiting the plentiful thread-level parallelism of applications. However, GPUs continue to underutilize their hardware resources for optimizing the performance of numerous general-purpose applications. One primary reason for this is the inefficiency of existing warp schedulers in hiding long-latency operations such as global loads and stores. This study proposes a long-latency operation-based warp scheduler to improve GPU performance. In the proposed warp scheduler, warps are partitioned into different pools based on the characteristics of instructions that are subsequently executed. Specifically, this warp scheduler uses warps that are likely waiting for long-latency operations for a guiding role. Meanwhile, other warps perform filling roles (i.e., to overlap the latencies caused by the guiding warps). Our experimental results demonstrate that the proposed warp scheduler improves GPU performance by 24.4% on average as compared to the conventional warp scheduler.

Files in This Item: There are no files associated with this item.

Appears in Collections: Graduate School > Department of Computer Science and Engineering > 1. Journal Articles

Show full item record

qrcode

Altmetrics

Total Views & Downloads

STATISTICS: Total View :8,866,468; Today View :28,781

RSS_1.0 RSS_2.0 ATOM_1.0

(02841) 서울특별시 성북구 안암로 14502-3290-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE