Collinear groupwise feature selection via discrete fusion group regression
- Authors
- Kim, Younghoon; Kim, Seoung Bum
- Issue Date
- Nov-2018
- Publisher
- ELSEVIER SCI LTD
- Keywords
- Multiple linear regression; Machine learning; Feature selection; Multicollinearity; Mixed-integer quadratic programming; Best subset selection
- Citation
- PATTERN RECOGNITION, v.83, pp.1 - 13
- Indexed
- SCIE
SCOPUS
- Journal Title
- PATTERN RECOGNITION
- Volume
- 83
- Start Page
- 1
- End Page
- 13
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/71978
- DOI
- 10.1016/j.patcog.2018.05.013
- ISSN
- 0031-3203
- Abstract
- We propose a method to select the subset of features in multiple linear regression models that considers the collinearity between features. The proposed method first detects collinear groups of features and then uses collinear groupwise feature selection constraints to estimate the coefficients of the regression model. The constraints simultaneously control the number of features selected and predefined collinear feature groups. We manage the multicollinearity in the regression model by controlling the parameters of the fusion group constraint. To address the NP-hard problem of the proposed method, we propose a modified discrete first-order algorithm. We use simulation and real-world data to demonstrate the usefulness of the proposed method by comparing it to existing regularization and discrete optimization-based methods in terms of predictive accuracy, bias, and variance. The comparison confirms that the proposed method outperforms the alternatives. (C) 2018 Elsevier Ltd. All rights reserved.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Engineering > School of Industrial and Management Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.