Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Real-Time Continuous Phoneme Recognition System Using Class-Dependent Tied-Mixture HMM With HBT Structure for Speech-Driven Lip-Sync

Full metadata record
DC Field Value Language
dc.contributor.authorPark, Junho-
dc.contributor.authorKo, Hanseok-
dc.date.accessioned2021-09-09T02:50:09Z-
dc.date.available2021-09-09T02:50:09Z-
dc.date.created2021-06-10-
dc.date.issued2008-11-
dc.identifier.issn1520-9210-
dc.identifier.urihttps://scholar.korea.ac.kr/handle/2021.sw.korea/122424-
dc.description.abstractThis work describes a real-time lip-sync method using which an avatar's lip shape is synchronized with the corresponding speech signal. Phoneme recognition is generally regarded as an important task in the operation of a real-time lip-sync system. In this work, the use of the Head-Body-Tail (HBT) model is proposed for the purpose of more efficiently recognizing phonemes which are variously uttered due to co-articulation effects. The HBT model effectively deals with the transition parts of context-dependent models for small-sized vocabulary tasks. These models provide better recognition performance than general context-dependent or context-independent models for the task of digit or vowel recognition. Moreover, each phoneme is categorized into one among four classes and the class-dependent codebook is generated to further improve the performance. Additionally, for the clear representation of the context dependency information in the transient parts, some Gaussians are excluded from class-dependent codebook. The proposed method leads to a lip-sync system that performs at a level that is similar to previous designs based on HBT and continuous hidden Markov models (CHMMs). However, our method reduces the number of model parameters by one-third and enables real-time operation.-
dc.languageEnglish-
dc.language.isoen-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleReal-Time Continuous Phoneme Recognition System Using Class-Dependent Tied-Mixture HMM With HBT Structure for Speech-Driven Lip-Sync-
dc.typeArticle-
dc.contributor.affiliatedAuthorKo, Hanseok-
dc.identifier.doi10.1109/TMM.2008.2004908-
dc.identifier.scopusid2-s2.0-56549088313-
dc.identifier.wosid000261310700007-
dc.identifier.bibliographicCitationIEEE TRANSACTIONS ON MULTIMEDIA, v.10, no.7, pp.1299 - 1306-
dc.relation.isPartOfIEEE TRANSACTIONS ON MULTIMEDIA-
dc.citation.titleIEEE TRANSACTIONS ON MULTIMEDIA-
dc.citation.volume10-
dc.citation.number7-
dc.citation.startPage1299-
dc.citation.endPage1306-
dc.type.rimsART-
dc.type.docTypeArticle-
dc.description.journalClass1-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryComputer Science, Software Engineering-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.subject.keywordAuthorHead-body-tail HMM-
dc.subject.keywordAuthorphoneme recognition-
dc.subject.keywordAuthorreal-time lip-sync-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Engineering > School of Electrical Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Ko, Han seok photo

Ko, Han seok
공과대학 (전기전자공학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE