Composite large margin classifiers with latent subclasses for heterogeneous biomedical data

Chen, Guanhua; Liu, Yufeng; Shen, Dinggang; Kosorok, Michael R.

doi:10.1002/sam.11300

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Composite large margin classifiers with latent subclasses for heterogeneous biomedical data

Authors: Chen, Guanhua; Liu, Yufeng; Shen, Dinggang; Kosorok, Michael R.

Issue Date: 4월-2016

Publisher: WILEY

Keywords: classification; large margin; latent subclasses; principal component analysis

Citation: STATISTICAL ANALYSIS AND DATA MINING, v.9, no.2, pp.75 - 88

Indexed: SCIE
SCOPUS

Journal Title: STATISTICAL ANALYSIS AND DATA MINING

Volume: 9

Number: 2

Start Page: 75

End Page: 88

URI: https://scholar.korea.ac.kr/handle/2021.sw.korea/89010

DOI: 10.1002/sam.11300

ISSN: 1932-1872

Abstract: High-dimensional classification problems are prevalent in a wide range of modern scientific applications. Despite a large number of candidate classification techniques available to use, practitioners often face a dilemma of choosing between linear and general nonlinear classifiers. Specifically, simple linear classifiers have good interpretability, but may have limitations in handling data with complex structures. In contrast, general nonlinear classifiers are more flexible, but may lose interpretability and have higher tendency for overfitting. In this paper, we consider data with potential latent subgroups in the classes of interest. We propose a new method, namely the composite large margin (CLM) classifier, to address the issue of classification with latent subclasses. The CLM aims to find three linear functions simultaneously: one linear function to split the data into two parts, with each part being classified by a different linear classifier. Our method has comparable prediction accuracy to a general nonlinear classifier, and it maintains the interpretability of traditional linear classifiers. We demonstrate the competitive performance of the CLM through comparisons with several existing linear and nonlinear classifiers by Monte Carlo experiments. Analysis of the Alzheimer's disease classification problem using CLM not only provides a lower classification error in discriminating cases and controls, but also identifies subclasses in controls that are more likely to develop the disease in the future.

Files in This Item: There are no files associated with this item.

Appears in Collections: Graduate School > Department of Artificial Intelligence > 1. Journal Articles

Show full item record

qrcode

Altmetrics

Total Views & Downloads

STATISTICS: Total View :9,536,682; Today View :166

RSS_1.0 RSS_2.0 ATOM_1.0

(02841) 서울특별시 성북구 안암로 14502-3290-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE