Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

A novel warp scheduling scheme considering long-latency operations for high-performance GPUs

Authors
Cong Thuan DoChoi, Hong JunChung, Sung WooKim, Cheol Hong
Issue Date
Apr-2020
Publisher
SPRINGER
Keywords
GPGPU; Performance; Memory latency; Utilization; Warp scheduling
Citation
JOURNAL OF SUPERCOMPUTING, v.76, no.4, pp.3043 - 3062
Indexed
SCIE
SCOPUS
Journal Title
JOURNAL OF SUPERCOMPUTING
Volume
76
Number
4
Start Page
3043
End Page
3062
URI
https://scholar.korea.ac.kr/handle/2021.sw.korea/56829
DOI
10.1007/s11227-019-03091-2
ISSN
0920-8542
Abstract
Graphics processing units (GPUs) have become one of the best platforms for exploiting the plentiful thread-level parallelism of applications. However, GPUs continue to underutilize their hardware resources for optimizing the performance of numerous general-purpose applications. One primary reason for this is the inefficiency of existing warp schedulers in hiding long-latency operations such as global loads and stores. This study proposes a long-latency operation-based warp scheduler to improve GPU performance. In the proposed warp scheduler, warps are partitioned into different pools based on the characteristics of instructions that are subsequently executed. Specifically, this warp scheduler uses warps that are likely waiting for long-latency operations for a guiding role. Meanwhile, other warps perform filling roles (i.e., to overlap the latencies caused by the guiding warps). Our experimental results demonstrate that the proposed warp scheduler improves GPU performance by 24.4% on average as compared to the conventional warp scheduler.
Files in This Item
There are no files associated with this item.
Appears in
Collections
Graduate School > Department of Computer Science and Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE