Multitask Learning of Deep Neural Network-Based Keyword Spotting for IoT Devices

Leem, Seong-Gyun; Yoo, In-Chul; Yook, Dongsuk

doi:10.1109/TCE.2019.2899067

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Multitask Learning of Deep Neural Network-Based Keyword Spotting for IoT Devices

Full metadata record

DC Field	Value	Language
dc.contributor.author	Leem, Seong-Gyun	-
dc.contributor.author	Yoo, In-Chul	-
dc.contributor.author	Yook, Dongsuk	-
dc.date.accessioned	2021-09-01T15:51:58Z	-
dc.date.available	2021-09-01T15:51:58Z	-
dc.date.created	2021-06-19	-
dc.date.issued	2019-05	-
dc.identifier.issn	0098-3063	-
dc.identifier.uri	https://scholar.korea.ac.kr/handle/2021.sw.korea/65922	-
dc.description.abstract	Speech-based interfaces are convenient and intuitive, and therefore, strongly preferred by Internet of Things (IoT) devices for human-computer interaction. Pre-defined keywords are typically used as a trigger to notify devices for inputting the subsequent voice commands. Keyword spotting techniques used as voice trigger mechanisms, typically model the target keyword via triphone models and non-keywords through single-state filler models. Recently, deep neural networks (DNNs) have shown better performance compared to hidden Markov models with Gaussian mixture models, in various tasks including speech recognition. However, conventional DNN-based keyword spotting methods cannot change the target keywords easily, which is an essential feature for speech-based IoT device interface. Additionally, the increase in computational requirements interferes with the use of complex filler models in DNN-based keyword spotting systems, which diminishes the accuracy of such systems. In this paper, we propose a novel DNN-based keyword spotting system that alters the keyword on the fly and utilizes triphone and monophone acoustic models in an effort to reduce computational complexity and increase generalization performance. The experimental results using the FFMTIMIT corpus show that the error rate of the proposed method was reduced by 36.6%.	-
dc.language	English	-
dc.language.iso	en	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.title	Multitask Learning of Deep Neural Network-Based Keyword Spotting for IoT Devices	-
dc.type	Article	-
dc.contributor.affiliatedAuthor	Yoo, In-Chul	-
dc.contributor.affiliatedAuthor	Yook, Dongsuk	-
dc.identifier.doi	10.1109/TCE.2019.2899067	-
dc.identifier.scopusid	2-s2.0-85061546748	-
dc.identifier.wosid	000466181000008	-
dc.identifier.bibliographicCitation	IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, v.65, no.2, pp.188 - 194	-
dc.relation.isPartOf	IEEE TRANSACTIONS ON CONSUMER ELECTRONICS	-
dc.citation.title	IEEE TRANSACTIONS ON CONSUMER ELECTRONICS	-
dc.citation.volume	65	-
dc.citation.number	2	-
dc.citation.startPage	188	-
dc.citation.endPage	194	-
dc.type.rims	ART	-
dc.type.docType	Article	-
dc.description.journalClass	1	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Telecommunications	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.relation.journalWebOfScienceCategory	Telecommunications	-
dc.subject.keywordAuthor	Deep neural network	-
dc.subject.keywordAuthor	keyword spotting	-
dc.subject.keywordAuthor	multitask learning	-

Files in This Item: There are no files associated with this item.

Appears in Collections: Graduate School > Department of Computer Science and Engineering > 1. Journal Articles

Show simple item record

qrcode

Altmetrics

Total Views & Downloads

STATISTICS: Total View :8,708,345; Today View :39,786

RSS_1.0 RSS_2.0 ATOM_1.0

(02841) 서울특별시 성북구 안암로 14502-3290-1114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE