Multitask Learning of Deep Neural Network-Based Keyword Spotting for IoT Devices
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Leem, Seong-Gyun | - |
dc.contributor.author | Yoo, In-Chul | - |
dc.contributor.author | Yook, Dongsuk | - |
dc.date.accessioned | 2021-09-01T15:51:58Z | - |
dc.date.available | 2021-09-01T15:51:58Z | - |
dc.date.created | 2021-06-19 | - |
dc.date.issued | 2019-05 | - |
dc.identifier.issn | 0098-3063 | - |
dc.identifier.uri | https://scholar.korea.ac.kr/handle/2021.sw.korea/65922 | - |
dc.description.abstract | Speech-based interfaces are convenient and intuitive, and therefore, strongly preferred by Internet of Things (IoT) devices for human-computer interaction. Pre-defined keywords are typically used as a trigger to notify devices for inputting the subsequent voice commands. Keyword spotting techniques used as voice trigger mechanisms, typically model the target keyword via triphone models and non-keywords through single-state filler models. Recently, deep neural networks (DNNs) have shown better performance compared to hidden Markov models with Gaussian mixture models, in various tasks including speech recognition. However, conventional DNN-based keyword spotting methods cannot change the target keywords easily, which is an essential feature for speech-based IoT device interface. Additionally, the increase in computational requirements interferes with the use of complex filler models in DNN-based keyword spotting systems, which diminishes the accuracy of such systems. In this paper, we propose a novel DNN-based keyword spotting system that alters the keyword on the fly and utilizes triphone and monophone acoustic models in an effort to reduce computational complexity and increase generalization performance. The experimental results using the FFMTIMIT corpus show that the error rate of the proposed method was reduced by 36.6%. | - |
dc.language | English | - |
dc.language.iso | en | - |
dc.publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC | - |
dc.title | Multitask Learning of Deep Neural Network-Based Keyword Spotting for IoT Devices | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Yoo, In-Chul | - |
dc.contributor.affiliatedAuthor | Yook, Dongsuk | - |
dc.identifier.doi | 10.1109/TCE.2019.2899067 | - |
dc.identifier.scopusid | 2-s2.0-85061546748 | - |
dc.identifier.wosid | 000466181000008 | - |
dc.identifier.bibliographicCitation | IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, v.65, no.2, pp.188 - 194 | - |
dc.relation.isPartOf | IEEE TRANSACTIONS ON CONSUMER ELECTRONICS | - |
dc.citation.title | IEEE TRANSACTIONS ON CONSUMER ELECTRONICS | - |
dc.citation.volume | 65 | - |
dc.citation.number | 2 | - |
dc.citation.startPage | 188 | - |
dc.citation.endPage | 194 | - |
dc.type.rims | ART | - |
dc.type.docType | Article | - |
dc.description.journalClass | 1 | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalResearchArea | Telecommunications | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.relation.journalWebOfScienceCategory | Telecommunications | - |
dc.subject.keywordAuthor | Deep neural network | - |
dc.subject.keywordAuthor | keyword spotting | - |
dc.subject.keywordAuthor | multitask learning | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
(02841) 서울특별시 성북구 안암로 14502-3290-1114
COPYRIGHT © 2021 Korea University. All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.