Evaluation of speaker de-identification based on voice gender and age conversion

Two basic tasks are covered in this paper. The first one consists in the design and practical testing of a new method for voice de-identification that changes the apparent age and/or gender of a speaker by multi-segmental frequency scale transformation combined with prosody modification. The second task is aimed at verification of applicability of a classifier based on Gaussian mixture models (GMM) to detect the original Czech and Slovak speakers after applied voice deidentification. The performed experiments confirm functionality of the developed gender and age conversion for all selected types of de-identification which can be objectively evaluated by the GMM-based open-set classifier. The original speaker detection accuracy was compared also for sentences uttered by German and English speakers showing language independence of the proposed method.

eISSN:: 1339-309X
Language:: English

Publication timeframe:: 6 times per year
Journal Subjects:: Engineering, Introductions and Overviews, other

Journal RSS Feed

Evaluation of speaker de-identification based on voice gender and age conversion

Published Online: May 30, 2018

Page range: 138 - 147

Received: Nov 14, 2017

DOI: https://doi.org/10.2478/jee-2018-0017

Keywords
GMM classifier, spectral and prosodic features of speech, speaker gender and age classification

© 2018 Jiří Přibil et al., published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Evaluation of speaker de-identification based on voice gender and age conversion

Published Online: May 30, 2018

Page range: 138 - 147

Received: Nov 14, 2017

DOI: https://doi.org/10.2478/jee-2018-0017

KeywordsGMM classifier, spectral and prosodic features of speech, speaker gender and age classification

© 2018 Jiří Přibil et al., published by Sciendo

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Keywords
GMM classifier, spectral and prosodic features of speech, speaker gender and age classification