Performance of deep learning to detect mastoiditis using multiple conventional radiographs of mastoid
- Authors
- Lee, Kyong Joon; Ryoo, Inseon; Choi, Dongjun; Sunwoo, Leonard; You, Sung-Hye; Jung, Hye Na
- Issue Date
- 11-11월-2020
- Publisher
- PUBLIC LIBRARY SCIENCE
- Citation
- PLOS ONE, v.15, no.11
- Indexed
- SCIE
SCOPUS
- Journal Title
- PLOS ONE
- Volume
- 15
- Number
- 11
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/51816
- DOI
- 10.1371/journal.pone.0241796
- ISSN
- 1932-6203
- Abstract
- Objectives This study aimed to compare the diagnostic performance of deep learning algorithm trained by single view (anterior-posterior (AP) or lateral view) with that trained by multiple views (both views together) in diagnosis of mastoiditis on mastoid series and compare the diagnostic performance between the algorithm and radiologists. Methods Total 9,988 mastoid series (AP and lateral views) were classified as normal or abnormal (mastoiditis) based on radiographic findings. Among them 792 image sets with temporal bone CT were classified as the gold standard test set and remaining sets were randomly divided into training (n = 8,276) and validation (n = 920) sets by 9:1 for developing a deep learning algorithm. Temporal (n = 294) and geographic (n = 308) external test sets were also collected. Diagnostic performance of deep learning algorithm trained by single view was compared with that trained by multiple views. Diagnostic performance of the algorithm and two radiologists was assessed. Inter-observer agreement between the algorithm and radiologists and between two radiologists was calculated. Results Area under the receiver operating characteristic curves of algorithm using multiple views (0.971, 0.978, and 0.965 for gold standard, temporal, and geographic external test sets, respectively) showed higher values than those using single view (0.964/0.953, 0.952/0.961, and 0.961/0.942 for AP view/lateral view of gold standard, temporal external, and geographic external test sets, respectively) in all test sets. The algorithm showed statistically significant higher specificity compared with radiologists (p = 0.018 and 0.012). There was substantial agreement between the algorithm and two radiologists and between two radiologists (kappa = 0.79, 0.8, and 0.76). Conclusion The deep learning algorithm trained by multiple views showed better performance than that trained by single view. The diagnostic performance of the algorithm for detecting mastoiditis on mastoid series was similar to or higher than that of radiologists.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Medicine > Department of Medical Science > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.