Stability approach to selecting the number of principal components
- Authors
- Song, Jiyeon; Shin, Seung Jun
- Issue Date
- 12월-2018
- Publisher
- SPRINGER HEIDELBERG
- Keywords
- Principal component analysis; Stability selection; Structural dimension; Subsampling
- Citation
- COMPUTATIONAL STATISTICS, v.33, no.4, pp.1923 - 1938
- Indexed
- SCIE
SCOPUS
- Journal Title
- COMPUTATIONAL STATISTICS
- Volume
- 33
- Number
- 4
- Start Page
- 1923
- End Page
- 1938
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/71392
- DOI
- 10.1007/s00180-018-0826-7
- ISSN
- 0943-4062
- Abstract
- Principal component analysis (PCA) is a canonical tool that reduces data dimensionality by finding linear transformations that project the data into a lower dimensional subspace while preserving the variability of the data. Selecting the number of principal components (PC) is essential but challenging for PCA since it represents an unsupervised learning problem without a clear target label at the sample level. In this article, we propose a new method to determine the optimal number of PCs based on the stability of the space spanned by PCs. A series of analyses with both synthetic data and real data demonstrates the superior performance of the proposed method.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Political Science & Economics > Department of Statistics > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.