Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Visual Voice Activity Detection via Chaos Based Lip Motion Measure Robust under Illumination Changes

Full metadata record
DC Field Value Language
dc.contributor.authorSong, Taeyup-
dc.contributor.authorLee, Kyungsun-
dc.contributor.authorKo, Hanseok-
dc.date.accessioned2021-09-05T09:05:27Z-
dc.date.available2021-09-05T09:05:27Z-
dc.date.created2021-06-15-
dc.date.issued2014-05-
dc.identifier.issn0098-3063-
dc.identifier.urihttps://scholar.korea.ac.kr/handle/2021.sw.korea/98600-
dc.description.abstractIn this paper, a vision based voice activity detection (VVAD) algorithm is proposed using chaos theory. In conventional VVAD algorithm, the movement measure of lip region is found by applying an optical flow algorithm to detect the visual speech frame using a motion based energy feature set. However, since motion based feature is unstable under illumination changes, a new form of robust feature set is desirable. It is propositioned that contextual changes such as lip opening or closing motion during speech utterances under illumination variation can be observed as chaos-like and the resultant complex fractal trajectories in phase space can be observed. The fractality is measured in phase space from two sequential video input frames and subsequently any visual speech frames are robustly detected. Representative experiments are performed in image sequence containing a driver scene undergoing illumination fluctuations in moving vehicle environment. Experimental results indicate that a substantial improvement is obtained in terms of achieving significantly lower false alarm rate over the conventional method.(1)-
dc.languageEnglish-
dc.language.isoen-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.subjectEND-POINT DETECTION-
dc.subjectSPEECH RECOGNITION-
dc.subjectALGORITHM-
dc.subjectENERGY-
dc.titleVisual Voice Activity Detection via Chaos Based Lip Motion Measure Robust under Illumination Changes-
dc.typeArticle-
dc.contributor.affiliatedAuthorKo, Hanseok-
dc.identifier.doi10.1109/TCE.2014.6852001-
dc.identifier.scopusid2-s2.0-84904602401-
dc.identifier.wosid000344364600012-
dc.identifier.bibliographicCitationIEEE TRANSACTIONS ON CONSUMER ELECTRONICS, v.60, no.2, pp.251 - 257-
dc.relation.isPartOfIEEE TRANSACTIONS ON CONSUMER ELECTRONICS-
dc.citation.titleIEEE TRANSACTIONS ON CONSUMER ELECTRONICS-
dc.citation.volume60-
dc.citation.number2-
dc.citation.startPage251-
dc.citation.endPage257-
dc.type.rimsART-
dc.type.docTypeArticle-
dc.description.journalClass1-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.subject.keywordPlusEND-POINT DETECTION-
dc.subject.keywordPlusSPEECH RECOGNITION-
dc.subject.keywordPlusALGORITHM-
dc.subject.keywordPlusENERGY-
dc.subject.keywordAuthorChaos inspired motion feature-
dc.subject.keywordAuthorvoice activity detection-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Engineering > School of Electrical Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Ko, Han seok photo

Ko, Han seok
공과대학 (전기전자공학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE