STFT와 RNN을 활용한 화자 인증 모델

김민서; 문종섭

doi:10.13089/JKIISC.2019.29.6.1393

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

STFT와 RNN을 활용한 화자 인증 모델Speaker Verification Model Using Short-Time Fourier Transform and Recurrent Neural Network

Other Titles: Speaker Verification Model Using Short-Time Fourier Transform and Recurrent Neural Network

Authors: 김민서; 문종섭

Issue Date: 2019

Publisher: 한국정보보호학회

Keywords: Speaker verification; STFT; Deep Learning; Recurrent Neural Network(RNN)

Citation: 정보보호학회논문지, v.29, no.6, pp.1393 - 1401

Indexed: KCI

Journal Title: 정보보호학회논문지

Volume: 29

Number: 6

Start Page: 1393

End Page: 1401

URI: https://scholar.korea.ac.kr/handle/2021.sw.korea/70056

DOI: 10.13089/JKIISC.2019.29.6.1393

ISSN: 1598-3986

Abstract: 최근 시스템에 음성 인증 기능이 탑재됨에 따라 화자(Speaker)를 정확하게 인증하는 중요성이 높아지고 있다. 이에 따라 다양한 방법으로 화자를 인증하는 모델이 제시되어 왔다. 본 논문에서는 Short-time Fouriertransform(STFT)를 적용한 새로운 화자 인증 모델을 제안한다. 이 모델은 기존의 Mel-Frequency CepstrumCoefficients(MFCC) 추출 방법과 달리 윈도우 함수를 약 66.1% 오버랩하여 화자 인증 시 정확도를 높일 수 있다. 새로운 화자 인증 모델을 제안한다. 이 때, LSTM 셀을 적용한 Recurrent Neural Network(RNN)라는 딥러닝 모델을 사용하여 시변적 특징을 가지는 화자의 음성 특징을 학습하고, 정확도가 92.8%로 기존의 화자 인증 모델보다 5.5% 정확도가 높게 측정되었다.

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Science and Technology > Department of Electronics and Information Engineering > 1. Journal Articles

Show full item record

qrcode

Altmetrics

Total Views & Downloads

STATISTICS: Total View :7,330,875; Today View :5,735

RSS_1.0 RSS_2.0 ATOM_1.0

145 Anam-ro, Seongbuk-gu, Seoul, 02841, Korea+82-2-3290-2963

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE