Sequential random k-nearest neighbor feature selection for high-dimensional data
- Authors
- Park, Chan Hee; Kim, Seoung Bum
- Issue Date
- 1-4월-2015
- Publisher
- PERGAMON-ELSEVIER SCIENCE LTD
- Keywords
- Feature selection; High dimensionality; Ensemble; Wrapper; Random forest; k-NN
- Citation
- EXPERT SYSTEMS WITH APPLICATIONS, v.42, no.5, pp.2336 - 2342
- Indexed
- SCIE
SCOPUS
- Journal Title
- EXPERT SYSTEMS WITH APPLICATIONS
- Volume
- 42
- Number
- 5
- Start Page
- 2336
- End Page
- 2342
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/93878
- DOI
- 10.1016/j.eswa.2014.10.044
- ISSN
- 0957-4174
- Abstract
- Feature selection based on an ensemble classifier has been recognized as a crucial technique for modeling high-dimensional data. Feature selection based on the random forests model, which is constructed by aggregating multiple decision tree classifiers, has been widely used. However, a lack of stability and balance in decision trees decreases the robustness of random forests. This limitation motivated us to propose a feature selection method based on newly designed nearest-neighbor ensemble classifiers. The proposed method finds significant features by using an iterative procedure. We performed experiments with 20 datasets of microarray gene expressions to examine the property of the proposed method and compared it with random forests. The results demonstrated the effectiveness and robustness of the proposed method, especially when the number of features exceeds the number of observations. (C) 2014 Elsevier Ltd. All rights reserved.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Engineering > School of Industrial and Management Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.