Semi-Supervised Discriminative Classification Robust to Sample-Outliers and Feature-Noises
- Authors
- Adeli, Ehsan; Thung, Kim-Han; An, Le; Wu, Guorong; Shi, Feng; Wang, Tao; Shen, Dinggang
- Issue Date
- 2월-2019
- Publisher
- IEEE COMPUTER SOC
- Keywords
- Linear discriminant analysis; semi-supervised learning; robust classification; feature selection; sample outlier detection; Alzheimer' s disease; Parkinson' s disease; biomarker identification; disease diagnosis; nuclear norm; regularization
- Citation
- IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, v.41, no.2, pp.515 - 522
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
- Volume
- 41
- Number
- 2
- Start Page
- 515
- End Page
- 522
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/67844
- DOI
- 10.1109/TPAMI.2018.2794470
- ISSN
- 0162-8828
- Abstract
- Discriminative methods commonly produce models with relatively good generalization abilities. However, this advantage is challenged in real-world applications (e.g., medical image analysis problems), in which there often exist outlier data points (sample-outliers) and noises in the predictor values (feature-noises). Methods robust to both types of these deviations are somewhat overlooked in the literature. We further argue that denoising can be more effective, if we learn the model using all the available labeled and unlabeled samples, as the intrinsic geometry of the sample manifold can be better constructed using more data points. In this paper, we propose a semi-supervised robust discriminative classification method based on the least-squares formulation of linear discriminant analysis to detect sample-outliers and feature-noises simultaneously, using both labeled training and unlabeled testing data. We conduct several experiments on a synthetic, some benchmark semi-supervised learning, and two brain neurodegenerative disease diagnosis datasets (for Parkinson's and Alzheimer's diseases). Specifically for the application of neurodegenerative diseases diagnosis, incorporating robust machine learning methods can be of great benefit, due to the noisy nature of neuroimaging data. Our results show that our method outperforms the baseline and several state-of-the-art methods, in terms of both accuracy and the area under the ROC curve.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - Graduate School > Department of Artificial Intelligence > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.