Correlated variable importance for random forests
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Shin, Seung Beom | - |
dc.contributor.author | Cho, Hyung Jun | - |
dc.date.accessioned | 2022-03-04T06:40:58Z | - |
dc.date.available | 2022-03-04T06:40:58Z | - |
dc.date.created | 2021-12-07 | - |
dc.date.issued | 2021-04 | - |
dc.identifier.issn | 1225-066X | - |
dc.identifier.uri | https://scholar.korea.ac.kr/handle/2021.sw.korea/137712 | - |
dc.description.abstract | Random forests is a popular method that improves the instability and accuracy of decision trees by ensembles. In contrast to increasing the accuracy, the ease of interpretation is sacrificed; hence, to compensate for this, variable importance is provided. The variable importance indicates which variable plays a role more importantly in constructing the random forests. However, when a predictor is correlated with other predictors, the variable importance of the existing importance algorithm may be distorted. The downward bias of correlated predictors may reduce the importance of truly important predictors. We propose a new algorithm remedying the downward bias of correlated predictors. The performance of the proposed algorithm is demonstrated by the simulated data and illustrated by the real data. | - |
dc.language | Korean | - |
dc.language.iso | ko | - |
dc.publisher | KOREAN STATISTICAL SOC | - |
dc.title | Correlated variable importance for random forests | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Cho, Hyung Jun | - |
dc.identifier.doi | 10.5351/KJAS.2021.34.2.177 | - |
dc.identifier.wosid | 000668581100005 | - |
dc.identifier.bibliographicCitation | KOREAN JOURNAL OF APPLIED STATISTICS, v.34, no.2, pp.177 - 190 | - |
dc.relation.isPartOf | KOREAN JOURNAL OF APPLIED STATISTICS | - |
dc.citation.title | KOREAN JOURNAL OF APPLIED STATISTICS | - |
dc.citation.volume | 34 | - |
dc.citation.number | 2 | - |
dc.citation.startPage | 177 | - |
dc.citation.endPage | 190 | - |
dc.type.rims | ART | - |
dc.type.docType | Article | - |
dc.identifier.kciid | ART002712836 | - |
dc.description.journalClass | 2 | - |
dc.description.journalRegisteredClass | kci | - |
dc.relation.journalResearchArea | Mathematics | - |
dc.relation.journalWebOfScienceCategory | Statistics & Probability | - |
dc.subject.keywordAuthor | correlation | - |
dc.subject.keywordAuthor | random forests | - |
dc.subject.keywordAuthor | variable importance | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
(02841) 서울특별시 성북구 안암로 14502-3290-1114
COPYRIGHT © 2021 Korea University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.