Overlapping computation and communication of three-dimensional FDTD on a GPU cluster
- Authors
- Kim, Ki-Hwan; Park, Q-Han
- Issue Date
- 11월-2012
- Publisher
- ELSEVIER
- Keywords
- FDTD; GPU cluster; CUDA; OpenCL
- Citation
- COMPUTER PHYSICS COMMUNICATIONS, v.183, no.11, pp.2364 - 2369
- Indexed
- SCIE
SCOPUS
- Journal Title
- COMPUTER PHYSICS COMMUNICATIONS
- Volume
- 183
- Number
- 11
- Start Page
- 2364
- End Page
- 2369
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/107118
- DOI
- 10.1016/j.cpc.2012.06.003
- ISSN
- 0010-4655
- Abstract
- Large-scale electromagnetic field simulations using the FDTD (finite-difference time-domain) method require the use of CPU (graphics processing unit) clusters. However, the communication overhead caused by slow interconnections becomes a major performance bottleneck. In this paper, as a way to remove the bottleneck, we propose the 'kernel-split method' and the 'host-buffer method' which overlap computation and communication for the FDTD simulation on the CPU cluster. The host-buffer method in particular enables overlapping without any modifications to the update-kernels that are already in use. We also present theoretical formulas to predict the overlap threshold and the total throughput for each method. By using our overlap methods with 6 CPU nodes, we demonstrate that the total performance of 3D FDTD reaches 92% of a six-fold increase, which is the upper limit that would be reached if there were no communication overhead. (C) 2012 Elsevier B.V. All rights reserved.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Science > Department of Physics > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.