Visual Voice Activity Detection via Chaos Based Lip Motion Measure Robust under Illumination Changes
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Song, Taeyup | - |
dc.contributor.author | Lee, Kyungsun | - |
dc.contributor.author | Ko, Hanseok | - |
dc.date.accessioned | 2021-09-05T09:05:27Z | - |
dc.date.available | 2021-09-05T09:05:27Z | - |
dc.date.created | 2021-06-15 | - |
dc.date.issued | 2014-05 | - |
dc.identifier.issn | 0098-3063 | - |
dc.identifier.uri | https://scholar.korea.ac.kr/handle/2021.sw.korea/98600 | - |
dc.description.abstract | In this paper, a vision based voice activity detection (VVAD) algorithm is proposed using chaos theory. In conventional VVAD algorithm, the movement measure of lip region is found by applying an optical flow algorithm to detect the visual speech frame using a motion based energy feature set. However, since motion based feature is unstable under illumination changes, a new form of robust feature set is desirable. It is propositioned that contextual changes such as lip opening or closing motion during speech utterances under illumination variation can be observed as chaos-like and the resultant complex fractal trajectories in phase space can be observed. The fractality is measured in phase space from two sequential video input frames and subsequently any visual speech frames are robustly detected. Representative experiments are performed in image sequence containing a driver scene undergoing illumination fluctuations in moving vehicle environment. Experimental results indicate that a substantial improvement is obtained in terms of achieving significantly lower false alarm rate over the conventional method.(1) | - |
dc.language | English | - |
dc.language.iso | en | - |
dc.publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC | - |
dc.subject | END-POINT DETECTION | - |
dc.subject | SPEECH RECOGNITION | - |
dc.subject | ALGORITHM | - |
dc.subject | ENERGY | - |
dc.title | Visual Voice Activity Detection via Chaos Based Lip Motion Measure Robust under Illumination Changes | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Ko, Hanseok | - |
dc.identifier.doi | 10.1109/TCE.2014.6852001 | - |
dc.identifier.scopusid | 2-s2.0-84904602401 | - |
dc.identifier.wosid | 000344364600012 | - |
dc.identifier.bibliographicCitation | IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, v.60, no.2, pp.251 - 257 | - |
dc.relation.isPartOf | IEEE TRANSACTIONS ON CONSUMER ELECTRONICS | - |
dc.citation.title | IEEE TRANSACTIONS ON CONSUMER ELECTRONICS | - |
dc.citation.volume | 60 | - |
dc.citation.number | 2 | - |
dc.citation.startPage | 251 | - |
dc.citation.endPage | 257 | - |
dc.type.rims | ART | - |
dc.type.docType | Article | - |
dc.description.journalClass | 1 | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalResearchArea | Telecommunications | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.relation.journalWebOfScienceCategory | Telecommunications | - |
dc.subject.keywordPlus | END-POINT DETECTION | - |
dc.subject.keywordPlus | SPEECH RECOGNITION | - |
dc.subject.keywordPlus | ALGORITHM | - |
dc.subject.keywordPlus | ENERGY | - |
dc.subject.keywordAuthor | Chaos inspired motion feature | - |
dc.subject.keywordAuthor | voice activity detection | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
(02841) 서울특별시 성북구 안암로 14502-3290-1114
COPYRIGHT © 2021 Korea University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.