Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Speaker Anonymization for Personal Information Protection Using Voice Conversion Techniques

Full metadata record
DC Field Value Language
dc.contributor.authorYoo, In-Chul-
dc.contributor.authorLee, Keonnyeong-
dc.contributor.authorLeem, Seonggyun-
dc.contributor.authorOh, Hyunwoo-
dc.contributor.authorKo, Bonggu-
dc.contributor.authorYook, Dongsuk-
dc.date.accessioned2021-08-31T16:10:14Z-
dc.date.available2021-08-31T16:10:14Z-
dc.date.created2021-06-18-
dc.date.issued2020-
dc.identifier.issn2169-3536-
dc.identifier.urihttps://scholar.korea.ac.kr/handle/2021.sw.korea/59030-
dc.description.abstractAs speech-based user interfaces integrated in the devices such as AI speakers become ubiquitous, a large amount of user voice data is being collected to enhance the accuracy of speech recognition systems. Since such voice data contain personal information that can endanger the privacy of users, the issue of privacy protection in the speech data has garnered increasing attention after the introduction of the General Data Protection Regulation in the EU, which implies that restrictions and safety measures for the use of speech data become essential. This study aims to filter the speaker-related voice biometrics present in speech data such as voice fingerprint without altering the linguistic content to preserve the usefulness of the data while protecting the privacy of users. To achieve this, we propose an algorithm that produces anonymized speeches by adopting many-to-many voice conversion techniques based on variational autoencoders (VAEs) and modifying the speaker identity vectors of the VAE input to anonymize the speech data. We validated the effectiveness of the proposed method by measuring the speaker-related information and the original linguistic information retained in the resultant speech, using an open source speaker recognizer and a deep neural network-based automatic speech recognizer, respectively. Using the proposed method, the speaker identification accuracy of the speech data was reduced to 0.1-9.2%, indicating successful anonymization, while the speech recognition accuracy was maintained as 78.2-81.3%.-
dc.languageEnglish-
dc.language.isoen-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleSpeaker Anonymization for Personal Information Protection Using Voice Conversion Techniques-
dc.typeArticle-
dc.contributor.affiliatedAuthorYoo, In-Chul-
dc.contributor.affiliatedAuthorYook, Dongsuk-
dc.identifier.doi10.1109/ACCESS.2020.3035416-
dc.identifier.scopusid2-s2.0-85102826997-
dc.identifier.wosid000589738100001-
dc.identifier.bibliographicCitationIEEE ACCESS, v.8, pp.198637 - 198645-
dc.relation.isPartOfIEEE ACCESS-
dc.citation.titleIEEE ACCESS-
dc.citation.volume8-
dc.citation.startPage198637-
dc.citation.endPage198645-
dc.type.rimsART-
dc.type.docTypeArticle-
dc.description.journalClass1-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.subject.keywordAuthorData privacy-
dc.subject.keywordAuthordeep neural networks-
dc.subject.keywordAuthorspeaker anonymization-
dc.subject.keywordAuthorvariational autoencoder-
dc.subject.keywordAuthorvoice conversion-
Files in This Item
There are no files associated with this item.
Appears in
Collections
Graduate School > Department of Computer Science and Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE