MG-FIM: A MULTI-GPU FAST ITERATIVE METHOD USING ADAPTIVE DOMAIN DECOMPOSITION
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Hong, Sumin | - |
dc.contributor.author | Jang, Ganghee | - |
dc.contributor.author | Jeong, Won-Ki | - |
dc.date.accessioned | 2022-04-13T02:42:04Z | - |
dc.date.available | 2022-04-13T02:42:04Z | - |
dc.date.created | 2022-04-12 | - |
dc.date.issued | 2022 | - |
dc.identifier.issn | 1064-8275 | - |
dc.identifier.uri | https://scholar.korea.ac.kr/handle/2021.sw.korea/140193 | - |
dc.description.abstract | Applying the latest parallel computing technology has become a recent trend in Eikonal equation solvers. Many recent studies have focused on parallelization of Eikonal solvers for multithreaded CPUs or single GPU systems. However, multi-GPU Eikonal solvers are largely underresearched owing to their complexity in terms of data and task management. In this paper, we propose a novel adaptive domain decomposition method to realize an efficient implementation of the block-based fast iterative method on multiple GPUs. The proposed method progressively expands the computational domain assigned to each GPU to maximize load balancing and employs a locality-aware clustering algorithm to minimize inter-GPU communication overhead. We also propose various low-and high-level optimization techniques for GPU computing, such as overlapping CPU and GPU computation and inter-GPU data transfer using multiple CUDA streams. Thus, we effectively circumvent performance issues in the na{\i}\"ve parallelization using a regular decomposition method. The proposed method scales up to 6.6\times for eight GPUs. We demonstrate that our efficient parallel implementation of the proposed method achieves an improvement in runtime performance under various experimental setups. | - |
dc.language | English | - |
dc.language.iso | en | - |
dc.publisher | SIAM PUBLICATIONS | - |
dc.subject | FAST SWEEPING METHOD | - |
dc.subject | FAST MARCHING METHOD | - |
dc.subject | EIKONAL EQUATION | - |
dc.subject | SHAPE | - |
dc.subject | IMPLEMENTATIONS | - |
dc.subject | ALGORITHMS | - |
dc.title | MG-FIM: A MULTI-GPU FAST ITERATIVE METHOD USING ADAPTIVE DOMAIN DECOMPOSITION | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Jeong, Won-Ki | - |
dc.identifier.doi | 10.1137/21M1414644 | - |
dc.identifier.scopusid | 2-s2.0-85129474171 | - |
dc.identifier.wosid | 000773561400003 | - |
dc.identifier.bibliographicCitation | SIAM JOURNAL ON SCIENTIFIC COMPUTING, v.44, no.1, pp.C54 - C76 | - |
dc.relation.isPartOf | SIAM JOURNAL ON SCIENTIFIC COMPUTING | - |
dc.citation.title | SIAM JOURNAL ON SCIENTIFIC COMPUTING | - |
dc.citation.volume | 44 | - |
dc.citation.number | 1 | - |
dc.citation.startPage | C54 | - |
dc.citation.endPage | C76 | - |
dc.type.rims | ART | - |
dc.type.docType | Article | - |
dc.description.journalClass | 1 | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Mathematics | - |
dc.relation.journalWebOfScienceCategory | Mathematics, Applied | - |
dc.subject.keywordPlus | FAST SWEEPING METHOD | - |
dc.subject.keywordPlus | FAST MARCHING METHOD | - |
dc.subject.keywordPlus | EIKONAL EQUATION | - |
dc.subject.keywordPlus | SHAPE | - |
dc.subject.keywordPlus | IMPLEMENTATIONS | - |
dc.subject.keywordPlus | ALGORITHMS | - |
dc.subject.keywordAuthor | Eikonal equation | - |
dc.subject.keywordAuthor | viscosity solution | - |
dc.subject.keywordAuthor | GPU | - |
dc.subject.keywordAuthor | parallel computing | - |
dc.subject.keywordAuthor | job scheduling | - |
dc.subject.keywordAuthor | load balancing | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
(02841) 서울특별시 성북구 안암로 14502-3290-1114
COPYRIGHT © 2021 Korea University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.