Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

화자 임베딩과 발화 리듬의 연관관계에 대한 연구

Full metadata record
DC Field Value Language
dc.contributor.author김서현-
dc.contributor.author남호성-
dc.date.accessioned2022-03-06T10:40:36Z-
dc.date.available2022-03-06T10:40:36Z-
dc.date.created2022-02-10-
dc.date.issued2021-
dc.identifier.issn1225-4975-
dc.identifier.urihttps://scholar.korea.ac.kr/handle/2021.sw.korea/137973-
dc.description.abstractThe present study investigates if speech rhythm is encoded in the utterance-level speaker embedding which is an averaged value of frame-level speaker embeddings. When speaker encoders are used in Computer Assisted Pronunciation Training, finding what information is included in speaker embeddings is crucial because it defines what feature a learner should acquire to be fluent speaker. Rhythm has been regarded as a speaker identifiable feature. The speaker embeddings, however, may fail to capture rhythm features since the temporal dependency of prosody is likely to be lost by simple averaging. To quantify the degree to which rhythm information is encoded in the speaker embedding, the speaker embeddings were projected to the feature space by least square linear regression. The R-squared values for the rhythm features were consistently low across the models with the different number of parameters, in contrast to the acoustic features which showed the significantly high R-squared values. The result indicates that the utterance-mean embeddings did not encode speech rhythm of individual speaker. Based on the result, the way to better adopt speaker embeddings in CAPT system is discussed.-
dc.languageEnglish-
dc.language.isoen-
dc.publisher한국외국어대학교 외국어교육연구소-
dc.title화자 임베딩과 발화 리듬의 연관관계에 대한 연구-
dc.title.alternativeDoes the Speaker Embedding Encode Speech Rhythm?-
dc.typeArticle-
dc.contributor.affiliatedAuthor남호성-
dc.identifier.doi10.16933/sfle.2021.35.2.131-
dc.identifier.bibliographicCitation외국어교육연구, v.35, no.2, pp.131 - 144-
dc.relation.isPartOf외국어교육연구-
dc.citation.title외국어교육연구-
dc.citation.volume35-
dc.citation.number2-
dc.citation.startPage131-
dc.citation.endPage144-
dc.type.rimsART-
dc.identifier.kciidART002719268-
dc.description.journalClass2-
dc.description.journalRegisteredClasskci-
dc.subject.keywordAuthorComputer Assisted Pronunciation Training-
dc.subject.keywordAuthorSpeaker Embedding-
dc.subject.keywordAuthorVoice Conversion-
dc.subject.keywordAuthor음성 변환-
dc.subject.keywordAuthor컴퓨터 보조 발음 학습-
dc.subject.keywordAuthor화자 임베딩-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Liberal Arts > Department of English Language and Literature > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Nam, Ho sung photo

Nam, Ho sung
문과대학 (영어영문학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE