Language:

Differentially Private Speaker Anonymization

arXiv.org, 2023-01 [Peer Reviewed Journal]

2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. ;http://creativecommons.org/licenses/by/4.0 ;EISSN: 2331-8422 ;DOI: 10.48550/arxiv.2202.11823

Full text available

Citations Cited by

Actions
1. Add to My Research
2. Remove from My Research
3. E-mail
4. Print
5. Permalink
6. Citation
7. EasyBib
8. EndNote
9. RefWorks
10. Delicious
11. Export RIS
12. Export BibTeX

Title:
Differentially Private Speaker Anonymization
Author: Ali Shahin Shamsabadi ; Brij Mohan Lal Srivastava ; Bellet, Aurélien ; Vauquier, Nathalie ; Vincent, Emmanuel ; Maouche, Mohamed ; Tommasi, Marc ; Papernot, Nicolas
Subjects: Automatic speech recognition ; Computer Science - Cryptography and Security ; Computer Science - Learning ; Computer Science - Sound ; Embedding ; Extractors ; Feature extraction ; Linguistics ; Privacy ; Speech ; Speech recognition ; Training ; Upper bounds
Is Part Of: arXiv.org, 2023-01
Description: Sharing real-world speech utterances is key to the training and deployment of voice-based services. However, it also raises privacy risks as speech contains a wealth of personal data. Speaker anonymization aims to remove speaker information from a speech utterance while leaving its linguistic and prosodic attributes intact. State-of-the-art techniques operate by disentangling the speaker information (represented via a speaker embedding) from these attributes and re-synthesizing speech based on the speaker embedding of another speaker. Prior research in the privacy community has shown that anonymization often provides brittle privacy protection, even less so any provable guarantee. In this work, we show that disentanglement is indeed not perfect: linguistic and prosodic attributes still contain speaker information. We remove speaker information from these attributes by introducing differentially private feature extractors based on an autoencoder and an automatic speech recognizer, respectively, trained using noise layers. We plug these extractors in the state-of-the-art anonymization pipeline and generate, for the first time, private speech utterances with a provable upper bound on the speaker information they contain. We evaluate empirically the privacy and utility resulting from our differentially private speaker anonymization approach on the LibriSpeech data set. Experimental results show that the generated utterances retain very high utility for automatic speech recognition training and inference, while being much better protected against strong adversaries who leverage the full knowledge of the anonymization process to try to infer the speaker identity.
Publisher: Ithaca: Cornell University Library, arXiv.org
Language: English
Identifier: EISSN: 2331-8422
DOI: 10.48550/arxiv.2202.11823
Source: arXiv.org
AUTh Library subscriptions: ProQuest Central
ROAD
Open Access: Freely Accessible Journals by multiple vendors

Back to results list

Previous Result 13 Next Go to Next page


INSPIRE LIBRARY - TON DUC THANG UNIVERSITY	(84-028) 37 755 057	Feedback
19 Nguyen Huu Tho St. Dist.7, HCM	thuvien@tdtu.edu.vn	Feedback

Differentially Private Speaker Anonymization

Searching Remote Databases, Please Wait