A non-parametric method for data clustering with optimal variable weighting
- Authors
- Chung, Ji-Won; Choi, In-Chan
- Issue Date
- 2006
- Publisher
- SPRINGER-VERLAG BERLIN
- Citation
- INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2006, PROCEEDINGS, v.4224, pp.807 - 814
- Indexed
- SCIE
SCOPUS
- Journal Title
- INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2006, PROCEEDINGS
- Volume
- 4224
- Start Page
- 807
- End Page
- 814
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/123178
- ISSN
- 0302-9743
- Abstract
- Since cluster analysis in data mining often deals with large-scale high-dimensional data with masking variables, it is important to remove non-contributing variables for accurate cluster recovery and also for proper interpretation of clustering results. Although the weights obtained by variable weighting methods can be used for the purpose of variable selection (or, elimination), they alone hardly provide a clear guide on selecting variables for subsequent analysis. In addition, variable selection and variable weighting are highly interrelated with the choice on the number of clusters. In this paper, we propose a non-parametric data clustering method, based on the W-k-means type clustering, for an automated and joint decision on selecting variables, determining variable weights, and deciding the number of clusters. Conclusions are drawn from computational experiments with random data and real-life data.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Engineering > School of Industrial and Management Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.