화자 임베딩과 발화 리듬의 연관관계에 대한 연구Does the Speaker Embedding Encode Speech Rhythm?
- Other Titles
- Does the Speaker Embedding Encode Speech Rhythm?
- Authors
- 김서현; 남호성
- Issue Date
- 2021
- Publisher
- 한국외국어대학교 외국어교육연구소
- Keywords
- Computer Assisted Pronunciation Training; Speaker Embedding; Voice Conversion; 음성 변환; 컴퓨터 보조 발음 학습; 화자 임베딩
- Citation
- 외국어교육연구, v.35, no.2, pp.131 - 144
- Indexed
- KCI
- Journal Title
- 외국어교육연구
- Volume
- 35
- Number
- 2
- Start Page
- 131
- End Page
- 144
- URI
- https://scholar.korea.ac.kr/handle/2021.sw.korea/137973
- DOI
- 10.16933/sfle.2021.35.2.131
- ISSN
- 1225-4975
- Abstract
- The present study investigates if speech rhythm is encoded in the utterance-level speaker embedding which is an averaged value of frame-level speaker embeddings. When speaker encoders are used in Computer Assisted Pronunciation Training, finding what information is included in speaker embeddings is crucial because it defines what feature a learner should acquire to be fluent speaker. Rhythm has been regarded as a speaker identifiable feature. The speaker embeddings, however, may fail to capture rhythm features since the temporal dependency of prosody is likely to be lost by simple averaging. To quantify the degree to which rhythm information is encoded in the speaker embedding, the speaker embeddings were projected to the feature space by least square linear regression. The R-squared values for the rhythm features were consistently low across the models with the different number of parameters, in contrast to the acoustic features which showed the significantly high R-squared values. The result indicates that the utterance-mean embeddings did not encode speech rhythm of individual speaker. Based on the result, the way to better adopt speaker embeddings in CAPT system is discussed.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Liberal Arts > Department of English Language and Literature > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.