Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

The estimation of probability distribution for factor variables with many categorical values

Authors
Lee, MinhyeokKang, Yeong SeonSeok, Junhee
Issue Date
24-8월-2018
Publisher
PUBLIC LIBRARY SCIENCE
Citation
PLOS ONE, v.13, no.8
Indexed
SCIE
SCOPUS
Journal Title
PLOS ONE
Volume
13
Number
8
URI
https://scholar.korea.ac.kr/handle/2021.sw.korea/73731
DOI
10.1371/journal.pone.0202547
ISSN
1932-6203
Abstract
With recent developments of data technology in biomedicine, factor data such as diagnosis codes and genomic features, which can have tens to hundreds of discrete and unorderable categorical values, have emerged. While considered as a fundamental problem in statistical analyses, the estimation of probability distribution for such factor variables has not studied much because the previous studies have mainly focused on continuous variables and discrete factor variables with a few categories such as sex and race. In this work, we propose a nonparametric Bayesian procedure to estimate the probability distribution of factors with many categories. The proposed method was demonstrated through simulation studies under various conditions and showed significant improvements on the estimation errors from the previous conventional methods. In addition, the method was applied to the analysis of diagnosis data of intensive care unit patients, and generated interesting medical hypotheses. The overall results indicate that the proposed method will be useful in the analysis of biomedical factor data.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Engineering > School of Electrical Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher SEOK, Jun hee photo

SEOK, Jun hee
공과대학 (전기전자공학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE