A Novel Computational Method for Biomedical Binary Data Analysis: Development of a Thyroid Disease Index Using a Brute-Force Search with MLR Analysis
- Authors
- Lee, J.K.; Han, W.S.; Lee, J.-S.; Yoon, C.N.
- Issue Date
- 2017
- Publisher
- Wiley Blackwell
- Keywords
- Androgen; Brute-force search; Estrogen; Multiple linear regression; Thyroid disease index
- Citation
- Bulletin of the Korean Chemical Society, v.38, no.12, pp.1392 - 1397
- Indexed
- SCIE
SCOPUS
KCI
- Journal Title
- Bulletin of the Korean Chemical Society
- Volume
- 38
- Number
- 12
- Start Page
- 1392
- End Page
- 1397
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/86146
- DOI
- 10.1002/bkcs.11308
- ISSN
- 0253-2964
- Abstract
- The thyroid disease index (TDI), which estimates thyroid disease progress based on hormone concentration measurements and hormone pattern changes, was developed. In this study, we measured concentrations of hormone profiles in the androgen and estrogen metabolic pathways from 23 patients with thyroid disease, as well as 20 unaffected people. We illustrated that the hormones 2-hydroxyestrone (2-OH-E1), 2-hydroxyestradiol (2-OH-E2), 2-methoxyestrone (2-MeO-E1), 2-methoxyestradiol (2-MeO-E2), and 2-methoxyestradiol-3-methylether (2-MeO-E2-3-methylether) are related to the development of thyroid disease through t-tests. Though the concentration levels of these hormones generally increase as the disease progresses, big fluctuations cause the determining of a disease's progress by measuring hormone levels to be difficult. The differing patterns between the correlation matrices of the disease and control groups possibly indicates changes in hormone releasing patterns during the thyroid disease's progress. Because of a lack of progressive experimental data on thyroid disease, binary data for the two categories (the thyroid disease patients and the control group) was utilized. Binary logistic regression was used to analyze five risk factors associated with thyroid disease, and the highest overall accuracy was 97.7% with three risk factors. Logistic regression models, however, are unable to describe disease progress. Hence, the TDI was developed to estimate thyroid disease progress. An arbitrary ranking of disease progress was generated for the TDI equation. The ranking contained a total number of 29 030 400 entries with six stages from the control group and eight stages from the disease group. Multiple linear regression (MLR) analysis was performed with a brute-force search. The best result among the MLR runs presented strong correlation (r2 values of 0.840 and q2 values of 0.663) between the selected hormones and the values of the disease progress in the training set. Overall accuracy of our novel method was 90.7%, which is worse than the 97.7% of logistic regression models. Brute-force search with MLR analysis might classify different types of thyroid disease progress such as thyroid mass (0.8055), goiter (0.8806), thyroid mass which was a thyroid cancer before operation (0.8951 and 0.9112), and cancer (1.001–2.144). The results show that the TDI is a good indicator of thyroid disease progress and that brute-force search with MLR analysis is useful for biomedical binary data analysis. © 2017 Korean Chemical Society, Seoul & Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Medicine > Department of Medical Science > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.