Mutual Information between Discrete Variables with Many Categories using Recursive Adaptive Partitioning
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Seok, Junhee | - |
dc.contributor.author | Kang, Yeong Seon | - |
dc.date.accessioned | 2021-09-04T15:13:04Z | - |
dc.date.available | 2021-09-04T15:13:04Z | - |
dc.date.created | 2021-06-16 | - |
dc.date.issued | 2015-06-05 | - |
dc.identifier.issn | 2045-2322 | - |
dc.identifier.uri | https://scholar.korea.ac.kr/handle/2021.sw.korea/93283 | - |
dc.description.abstract | Mutual information, a general measure of the relatedness between two random variables, has been actively used in the analysis of biomedical data. The mutual information between two discrete variables is conventionally calculated by their joint probabilities estimated from the frequency of observed samples in each combination of variable categories. However, this conventional approach is no longer efficient for discrete variables with many categories, which can be easily found in large-scale biomedical data such as diagnosis codes, drug compounds, and genotypes. Here, we propose a method to provide stable estimations for the mutual information between discrete variables with many categories. Simulation studies showed that the proposed method reduced the estimation errors by 45 folds and improved the correlation coefficients with true values by 99 folds, compared with the conventional calculation of mutual information. The proposed method was also demonstrated through a case study for diagnostic data in electronic health records. This method is expected to be useful in the analysis of various biomedical data with discrete variables. | - |
dc.language | English | - |
dc.language.iso | en | - |
dc.publisher | NATURE PUBLISHING GROUP | - |
dc.subject | GENE-EXPRESSION DATA | - |
dc.subject | ENTROPY | - |
dc.title | Mutual Information between Discrete Variables with Many Categories using Recursive Adaptive Partitioning | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Seok, Junhee | - |
dc.identifier.doi | 10.1038/srep10981 | - |
dc.identifier.scopusid | 2-s2.0-84930656104 | - |
dc.identifier.wosid | 000355866800001 | - |
dc.identifier.bibliographicCitation | SCIENTIFIC REPORTS, v.5 | - |
dc.relation.isPartOf | SCIENTIFIC REPORTS | - |
dc.citation.title | SCIENTIFIC REPORTS | - |
dc.citation.volume | 5 | - |
dc.type.rims | ART | - |
dc.type.docType | Article | - |
dc.description.journalClass | 1 | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Science & Technology - Other Topics | - |
dc.relation.journalWebOfScienceCategory | Multidisciplinary Sciences | - |
dc.subject.keywordPlus | GENE-EXPRESSION DATA | - |
dc.subject.keywordPlus | ENTROPY | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
(02841) 서울특별시 성북구 안암로 14502-3290-1114
COPYRIGHT © 2021 Korea University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.