Support Vector Machine Using k-Spatial Medians Clustering and Recovery Process
- Authors
- Bang, Sungwan; Koo, Ja-Yong; Jhun, Myoungshic
- Issue Date
- 2010
- Publisher
- TAYLOR & FRANCIS INC
- Keywords
- Classification; k-means clustering; k-spatial medians clustering; Outlier; Recovery process; Support vector machine
- Citation
- COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, v.39, no.7, pp.1422 - 1434
- Indexed
- SCIE
SCOPUS
- Journal Title
- COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION
- Volume
- 39
- Number
- 7
- Start Page
- 1422
- End Page
- 1434
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/118596
- DOI
- 10.1080/03610918.2010.498642
- ISSN
- 0361-0918
- Abstract
- Even though support vector machine has been successfully applied to various classification problems with its flexibility and high classification accuracy, it is not suitable for classification of large data sets because its computational complexity grows rapidly as the size of data set increases. SVM using k-means clustering (KM-SVM) is a fast algorithm which has been developed to accelerate both computation and prediction of SVM classifiers. However, it seems likely that the data set is contaminated by outliers in real-world situations, and k-means clustering is sensitive to these outliers. Therefore, we propose to combine k-spatial medians clustering with SVM (KS-SVM) since k-spatial medians clustering is robust for outliers. In order to improve the classification accuracy in KS-SVM, furthermore, a recovery process based on KS-SVM (RKS-SVM) is also proposed in this article. Experiments show that KS-SVM can improve the performance of KM-SVM in terms of classification accuracy and number of support vector. It is also shown that the classification accuracy of RKS-SVM is better than one of KS-SVM, but there is a tradeoff between classification accuracy and the number of support vectors.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Political Science & Economics > Department of Statistics > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.