Performance impact of JobTracker failure in Hadoop
- Authors
- Kim, Young-Pil; Hong, Cheol-Ho; Yoo, Chuck
- Issue Date
- 10-5월-2015
- Publisher
- WILEY
- Keywords
- Hadoop; large-scale data processing; failure analysis; JobTracker
- Citation
- INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, v.28, no.7, pp.1265 - 1281
- Indexed
- SCIE
SCOPUS
- Journal Title
- INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS
- Volume
- 28
- Number
- 7
- Start Page
- 1265
- End Page
- 1281
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/93568
- DOI
- 10.1002/dac.2759
- ISSN
- 1074-5351
- Abstract
- In this paper, we analyze the performance impact of JobTracker failure in Hadoop. A JobTracker failure is a serious problem that affects the overall job processing performance. We describe the cause of failure and the system behaviors because of failed job processing in the Hadoop. On the basis of the analysis, we build a job completion time model that reflects failure effects. Our model is based on a stochastic process with a node crash probability. With our model, we run simulation of performance impact with very credible failure data available from USENIX called computer failure data repository that have been collected for past 9years. The results show that the performance impact is very severe in that the job completion time increases about four times typically, and in a worst case, it increases up to 68 times. Copyright (c) 2014 John Wiley & Sons, Ltd.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - Graduate School > Department of Computer Science and Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.