Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

화자 임베딩과 발화 리듬의 연관관계에 대한 연구Does the Speaker Embedding Encode Speech Rhythm?

Other Titles
Does the Speaker Embedding Encode Speech Rhythm?
Authors
김서현남호성
Issue Date
2021
Publisher
한국외국어대학교 외국어교육연구소
Keywords
Computer Assisted Pronunciation Training; Speaker Embedding; Voice Conversion; 음성 변환; 컴퓨터 보조 발음 학습; 화자 임베딩
Citation
외국어교육연구, v.35, no.2, pp.131 - 144
Indexed
KCI
Journal Title
외국어교육연구
Volume
35
Number
2
Start Page
131
End Page
144
URI
https://scholar.korea.ac.kr/handle/2021.sw.korea/137973
DOI
10.16933/sfle.2021.35.2.131
ISSN
1225-4975
Abstract
The present study investigates if speech rhythm is encoded in the utterance-level speaker embedding which is an averaged value of frame-level speaker embeddings. When speaker encoders are used in Computer Assisted Pronunciation Training, finding what information is included in speaker embeddings is crucial because it defines what feature a learner should acquire to be fluent speaker. Rhythm has been regarded as a speaker identifiable feature. The speaker embeddings, however, may fail to capture rhythm features since the temporal dependency of prosody is likely to be lost by simple averaging. To quantify the degree to which rhythm information is encoded in the speaker embedding, the speaker embeddings were projected to the feature space by least square linear regression. The R-squared values for the rhythm features were consistently low across the models with the different number of parameters, in contrast to the acoustic features which showed the significantly high R-squared values. The result indicates that the utterance-mean embeddings did not encode speech rhythm of individual speaker. Based on the result, the way to better adopt speaker embeddings in CAPT system is discussed.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Liberal Arts > Department of English Language and Literature > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Nam, Ho sung photo

Nam, Ho sung
문과대학 (영어영문학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE