DuETNet: Dual Encoder based Transfer Network for thoracic disease classification
- Authors
- Lee, Min Seok; Han, Sung Won
- Issue Date
- 9월-2022
- Publisher
- ELSEVIER
- Keywords
- Thoracic disease classification; Imbalanced multi -class classification; Convolutional neural networks; Resolution calibration; Entropy based label smoothing; Attention mechanism
- Citation
- PATTERN RECOGNITION LETTERS, v.161, pp.143 - 153
- Indexed
- SCIE
SCOPUS
- Journal Title
- PATTERN RECOGNITION LETTERS
- Volume
- 161
- Start Page
- 143
- End Page
- 153
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/145827
- DOI
- 10.1016/j.patrec.2022.08.007
- ISSN
- 0167-8655
- Abstract
- In thoracic disease classification, the original chest X-ray images are high resolution images. Nevertheless, in existing convolution neural network (CNN) models, the original images are resized to 224 x 224 before use. Diseases in local areas may not be sufficiently represented because the chest X-ray images have been resized, which compresses information excessively. Therefore, a higher resolution is required to focus on the local representations. Although the large input resolution reduces memory efficiency, previous studies have investigated using CNNs with the large input for classification performance improvement. Moreover, optimization for imbalanced classes is required because chest X-ray images have highly imbalanced pathology labels. Hence, this study proposes the Dual Encoder based Transfer Network (DuETNet) to counter the inefficiency caused by large input resolution and improve classification performance by adjusting the input size based on the RandomResizedCrop method. This image transformation method crops a random area of a given image and resizes it to a given size. Thus, a resolution calibration guideline is a practical way to achieve memory efficiency and performance gains under restricted resources by adjusting the scale factor a on the training and test images. To treat high class imbalance, we propose entropy based label smoothing method. The method enhances generalization performance for the imbalanced minor classes by penalizing the major classes. The dual encoder comprises channel and spatial encoders, which apply channel-and spatial-wise attention to enhance the relatively significant features from the adjusted images. To evaluate the performance of DuETNet, we used the ChestX-ray14 and MIMIC-CXR-JPG datasets, and DuETNet achieved a new state-of-the-art method. (C) 2022 Elsevier B.V. All rights reserved.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Engineering > School of Industrial and Management Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.