Weighted Support Vector Machine Using k-Means Clustering
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Bang, Sungwan | - |
dc.contributor.author | Jhun, Myoungshic | - |
dc.date.accessioned | 2021-09-05T02:49:07Z | - |
dc.date.available | 2021-09-05T02:49:07Z | - |
dc.date.created | 2021-06-15 | - |
dc.date.issued | 2014-11-26 | - |
dc.identifier.issn | 0361-0918 | - |
dc.identifier.uri | https://scholar.korea.ac.kr/handle/2021.sw.korea/96735 | - |
dc.description.abstract | The support vector machine (SVM) has been successfully applied to various classification areas with great flexibility and a high level of classification accuracy. However, the SVM is not suitable for the classification of large or imbalanced datasets because of significant computational problems and a classification bias toward the dominant class. The SVM combined with the k-means clustering (KM-SVM) is a fast algorithm developed to accelerate both the training and the prediction of SVM classifiers by using the cluster centers obtained from the k-means clustering. In the KM-SVM algorithm, however, the penalty of misclassification is treated equally for each cluster center even though the contributions of different cluster centers to the classification can be different. In order to improve classification accuracy, we propose the WKM-SVM algorithm which imposes different penalties for the misclassification of cluster centers by using the number of data points within each cluster as a weight. As an extension of the WKM-SVM, the recovery process based on WKM-SVM is suggested to incorporate the information near the optimal boundary. Furthermore, the proposed WKM-SVM can be successfully applied to imbalanced datasets with an appropriate weighting strategy. Experiments show the effectiveness of our proposed methods. | - |
dc.language | English | - |
dc.language.iso | en | - |
dc.publisher | TAYLOR & FRANCIS INC | - |
dc.subject | CLASSIFICATION | - |
dc.title | Weighted Support Vector Machine Using k-Means Clustering | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Jhun, Myoungshic | - |
dc.identifier.doi | 10.1080/03610918.2012.762388 | - |
dc.identifier.scopusid | 2-s2.0-84902654270 | - |
dc.identifier.wosid | 000337961200007 | - |
dc.identifier.bibliographicCitation | COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, v.43, no.10, pp.2307 - 2324 | - |
dc.relation.isPartOf | COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION | - |
dc.citation.title | COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION | - |
dc.citation.volume | 43 | - |
dc.citation.number | 10 | - |
dc.citation.startPage | 2307 | - |
dc.citation.endPage | 2324 | - |
dc.type.rims | ART | - |
dc.type.docType | Article | - |
dc.description.journalClass | 1 | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Mathematics | - |
dc.relation.journalWebOfScienceCategory | Statistics & Probability | - |
dc.subject.keywordPlus | CLASSIFICATION | - |
dc.subject.keywordAuthor | Classification | - |
dc.subject.keywordAuthor | Class imbalance | - |
dc.subject.keywordAuthor | k-means clustering | - |
dc.subject.keywordAuthor | Recovery process | - |
dc.subject.keywordAuthor | Weighted support vector machine | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
145 Anam-ro, Seongbuk-gu, Seoul, 02841, Korea+82-2-3290-2963
COPYRIGHT © 2021 Korea University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.