Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Machine Learning for the Prediction of New-Onset Diabetes Mellitus during 5-Year Follow-up in Non-Diabetic Patients with Cardiovascular Risks

Authors
Choi, Byoung GeolRha, Seung-WoonKim, Suhng WookKang, Jun HyukPark, Ji YoungNoh, Yung-Kyun
Issue Date
2월-2019
Publisher
YONSEI UNIV COLL MEDICINE
Keywords
Type 2 diabetes mellitus; diabetes; machine learning; prediction; big data
Citation
YONSEI MEDICAL JOURNAL, v.60, no.2, pp.191 - 199
Indexed
SCIE
SCOPUS
KCI
Journal Title
YONSEI MEDICAL JOURNAL
Volume
60
Number
2
Start Page
191
End Page
199
URI
https://scholar.korea.ac.kr/handle/2021.sw.korea/67811
DOI
10.3349/ymj.2019.60.2.191
ISSN
0513-5796
Abstract
Purpose: Many studies have proposed predictive models for type 2 diabetes mellitus (T2DM). However, these predictive models have several limitations, such as user convenience and reproducibility. The purpose of this study was to develop a T2DM predictive model using electronic medical records (EMRs) and machine learning and to compare the performance of this model with traditional statistical methods. Materials and Methods: In this study, a total of available 8454 patients who had no history of diabetes and were treated at the cardiovascular center of Korea University Gum Hospital were enrolled. All subjects completed 5 years of follow up. The prevalence of T2DM during follow up was 4.78% (404/8454). A total of 28 variables were extracted from the EMRs. In order to verify the cross-validation test according to the prediction model, logistic regression (LR), linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), and K-nearest neighbor (KNN) algorithm models were generated. The LR model was considered as the existing statistical analysis method. Results: All predictive models maintained a change within the standard deviation of area under the curve (AUC) <0.01 in the analysis after a 10-fold cross-validation test. Among all predictive models, the LR learning model showed the highest prediction performance, with an AUC of 0.78. However, compared to the LR model, the LDA, QDA, and KNN models did not show a statistically significant difference. Conclusion: We successfully developed and verified a T2DM prediction system using machine learning and an EMR database, and it predicted the 5-year occurrence of T2DM similarly to with a traditional prediction model. In further study, it is necessary to apply and verify the prediction model through clinical research.
Files in This Item
There are no files associated with this item.
Appears in
Collections
Graduate School > Department of Biomedical Sciences > 1. Journal Articles
College of Health Sciences > School of Health and Environmental Science > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Suhng Wook photo

Kim, Suhng Wook
보건과학대학 (보건환경융합과학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE