Storytelling Voice Conversion: Evaluation Experiment Using Gaussian Mixture Models

In the development of the voice conversion and personification of the text-to-speech (TTS) systems, it is very necessary to have feedback information about the users’ opinion on the resulting synthetic speech quality. Therefore, the main aim of the experiments described in this paper was to find out whether the classifier based on Gaussian mixture models (GMM) could be applied for evaluation of different storytelling voices created by transformation of the sentences generated by the Czech and Slovak TTS system. We suppose that it is possible to combine this GMM-based statistical evaluation with the classical one in the form of listening tests or it can replace them. The results obtained in this way were in good correlation with the results of the conventional listening test, so they confirm practical usability of the developed GMM classifier. With the help of the performed analysis, the optimal setting of the initial parameters and the structure of the input feature set for recognition of the storytelling voices was finally determined.

eISSN:: 1339-309X
Language:: English

Publication timeframe:: 6 times per year
Journal Subjects:: Engineering, Introductions and Overviews, other

Journal RSS Feed

Storytelling Voice Conversion: Evaluation Experiment Using Gaussian Mixture Models

Published Online: Sep 19, 2015

Page range: 194 - 202

Received: Dec 01, 2014

DOI: https://doi.org/10.2478/jee-2015-0032

Keywords
storytelling voice conversion, spectral and prosodic features of speech, evaluation of speech quality, GMM classifier

© Faculty of Electrical Engineering and Information Technology, Slovak University of Technology

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.

Storytelling Voice Conversion: Evaluation Experiment Using Gaussian Mixture Models

Published Online: Sep 19, 2015

Page range: 194 - 202

Received: Dec 01, 2014

DOI: https://doi.org/10.2478/jee-2015-0032

Keywordsstorytelling voice conversion, spectral and prosodic features of speech, evaluation of speech quality, GMM classifier

© Faculty of Electrical Engineering and Information Technology, Slovak University of Technology

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.

Keywords
storytelling voice conversion, spectral and prosodic features of speech, evaluation of speech quality, GMM classifier