Visual Speech Recognition Using Weighted Dynamic Time Warping

Lee, Kyungsun; Keum, Minseok; Han, David K.; Ko, Hanseok

doi:10.1587/transinf.2015EDL8002

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Visual Speech Recognition Using Weighted Dynamic Time Warping

Full metadata record

DC Field	Value	Language
dc.contributor.author	Lee, Kyungsun	-
dc.contributor.author	Keum, Minseok	-
dc.contributor.author	Han, David K.	-
dc.contributor.author	Ko, Hanseok	-
dc.date.accessioned	2021-09-04T14:56:31Z	-
dc.date.available	2021-09-04T14:56:31Z	-
dc.date.created	2021-06-16	-
dc.date.issued	2015-07	-
dc.identifier.issn	1745-1361	-
dc.identifier.uri	https://scholar.korea.ac.kr/handle/2021.sw.korea/93208	-
dc.description.abstract	It is unclear whether Hidden Markov Model (HMM) or Dynamic Time Warping (DTW) mapping is more appropriate for visual speech recognition when only small data samples are available. In this letter, the two approaches are compared in terms of sensitivity to the amount of training samples and computing time with the objective of determining the tipping point. The limited training data problem is addressed by exploiting a straightforward template matching via weighted-DTW. The proposed framework is a refined DTW by adjusting the warping paths with judicially injected weights to ensure a smooth diagonal path for accurate alignment without added computational load. The proposed WDTW is evaluated on three databases (two in the public domain and one developed in-house) for visual recognition performance. Subsequent experiments indicate that the proposed WDTW significantly enhances the recognition rate compared to the DTW and HMM based algorithms, especially under limited data samples.	-
dc.language	English	-
dc.language.iso	en	-
dc.publisher	IEICE-INST ELECTRONICS INFORMATION COMMUNICATIONS ENG	-
dc.title	Visual Speech Recognition Using Weighted Dynamic Time Warping	-
dc.type	Article	-
dc.contributor.affiliatedAuthor	Ko, Hanseok	-
dc.identifier.doi	10.1587/transinf.2015EDL8002	-
dc.identifier.scopusid	2-s2.0-84937597424	-
dc.identifier.wosid	000359474300024	-
dc.identifier.bibliographicCitation	IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, v.E98D, no.7, pp.1430 - 1433	-
dc.relation.isPartOf	IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS	-
dc.citation.title	IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS	-
dc.citation.volume	E98D	-
dc.citation.number	7	-
dc.citation.startPage	1430	-
dc.citation.endPage	1433	-
dc.type.rims	ART	-
dc.type.docType	Article	-
dc.description.journalClass	1	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalWebOfScienceCategory	Computer Science, Information Systems	-
dc.relation.journalWebOfScienceCategory	Computer Science, Software Engineering	-
dc.subject.keywordAuthor	visual speech recognition	-
dc.subject.keywordAuthor	lip reading	-

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Engineering > School of Electrical Engineering > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Ko, Han seok photo

Ko, Han seok: College of Engineering (School of Electrical Engineering)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :7,175,476; Today View :16,151

RSS_1.0 RSS_2.0 ATOM_1.0

145 Anam-ro, Seongbuk-gu, Seoul, 02841, Korea+82-2-3290-2963

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE