Analyzing GCN Aggregation on GPU

Kim, Inje; Jeong, Jonghyun; Oh, Yunho; Yoon, Myung Kuk; Koo, Gunjae

doi:10.1109/ACCESS.2022.3217222

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Analyzing GCN Aggregation on GPUopen access

Authors: Kim, Inje; Jeong, Jonghyun; Oh, Yunho; Yoon, Myung Kuk; Koo, Gunjae

Issue Date: 2022

Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords: Graphics processing units; Kernel; Convolutional neural networks; Neural networks; Mathematical models; Hardware; Data models; Graph neural networks; GCN; aggregation kernel; GPU; characteristics

Citation: IEEE ACCESS, v.10, pp.113046 - 113060

Indexed: SCIE
SCOPUS

Journal Title: IEEE ACCESS

Volume: 10

Start Page: 113046

End Page: 113060

URI: https://scholar.korea.ac.kr/handle/2021.sw.korea/145592

DOI: 10.1109/ACCESS.2022.3217222

ISSN: 2169-3536

Abstract: Graph convolutional neural networks (GCNs) are emerging neural networks for graph structures that include large features associated with each vertex. The operations of GCN can be divided into two phases - aggregation and combination. While the combination just performs matrix multiplications using trained weights and aggregated features, the aggregation phase requires graph traversal to collect features from adjacent vertices. Even though neural network applications rely on GPU's massively parallel processing, GCN aggregation kernels exhibit rather low performance since graph processing using compressed graph structures provokes frequent irregular accesses in GPUs. In order to investigate the performance hurdles of GCN aggregation on GPU, we perform an in-depth analysis of the aggregation kernels using real GPU hardware and a cycle-accurate GPU simulator. We first analyze the characteristics of the popular graph datasets used for GCN studies. We reveal the fractions of non-zero elements in feature vectors are diverse among datasets. Based on the observation, we build two types of aggregation kernels that handle uncompressed and compressed feature vectors. Our evaluation exhibits the performance of aggregation can be significantly influenced by kernel design approaches and feature density. We also analyze the individual loads that access the data arrays of the aggregation kernels to specify critical loads. Our analysis reveals the performance of GPU memory hierarchy is influenced by access patterns and feature size of graph datasets. Based on our observations we discuss possible kernel design approaches and architectural ideas that can improve the performance of GCN aggregation.

Files in This Item: There are no files associated with this item.

Appears in Collections: Graduate School > Department of Computer Science and Engineering > 1. Journal Articles

Show full item record

qrcode

Altmetrics

Total Views & Downloads

STATISTICS: Total View :8,488,356; Today View :64,658

RSS_1.0 RSS_2.0 ATOM_1.0

(02841) 서울특별시 성북구 안암로 14502-3290-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE