Robust facial landmark extraction scheme using multiple convolutional neural networks
- Authors
- Kim, Hyungjoon; Park, Jisoo; Kim, HyeonWoo; Hwang, Eenjun; Rho, Seungmin
- Issue Date
- 2월-2019
- Publisher
- SPRINGER
- Keywords
- Convolutional neural networks; Facial landmark; Semantic segmentation; Object detection; Faster R-CNN
- Citation
- MULTIMEDIA TOOLS AND APPLICATIONS, v.78, no.3, pp.3221 - 3238
- Indexed
- SCIE
SCOPUS
- Journal Title
- MULTIMEDIA TOOLS AND APPLICATIONS
- Volume
- 78
- Number
- 3
- Start Page
- 3221
- End Page
- 3238
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/67856
- DOI
- 10.1007/s11042-018-6482-7
- ISSN
- 1380-7501
- Abstract
- Facial landmarks are a set of features that can be distinguished on the human face with the naked eye. Typical facial landmarks include eyes, eyebrows, nose, and mouth. Landmarks play an important role in human-related image analysis. For example, they can be used to determine whether there is a human being in the image, identify who the person is, or recognize the orientation of a face when taking a photograph. General techniques for detecting facial landmarks can be classified into two groups: One is based on traditional image processing techniques, such as Haar cascade classifiers and edge detection. The other is based on machine learning techniques in which landmarks can be detected by training neural network using facial features. However, such techniques have shown low accuracy, especially in some special conditions such as low luminance and overlapped faces. To overcome these problems, we proposed in our previous work a facial landmark extraction scheme using deep learning and semantic segmentation, and demonstrated that with even a small dataset, our scheme could achieve reasonable facial landmark extraction performance under such conditions. Nevertheless, for more extensive dataset, we found several exceptional cases where the scheme could not detect face landmarks precisely. Hence, in this paper, we revise our facial landmark extraction scheme using a deep learning model called Faster R-CNN and show how our scheme can improve the overall performance by handling such exceptional cases appropriately. Also, we show how to expand the training dataset by using image filters and image operations such as rotation for more robust landmark detection.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Engineering > School of Electrical Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.